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REMARKS 

Claims 27, 29-42, 44-47, and 49-55 are pending in the present application. 
Reconsideration is respectfully requested in view of the following comments and the 
amendments above. 

The rejection of Claims 27-29, 35, 39, 43, 44, and 48 under 35 U.S.C. §112, first 
paragraph (written description; new matter), is obviated by the present amendment. 

Applicants make no statement in regard to the propriety of this ground of rejection 
and in no way acquiesce to the same. However, in order to expedite examination of this 
application, Applicants have deleted the phrase "all or part of from Claim 27. Therefore, 
this ground of rejection is now believed to be moot. 

Withdrawal of this ground of rejection is requested. 

The rejection of Claims 27-29, 35, 39, 43, 44, and 48 under 35 U.S.C. §112, first 
paragraph (written description and enablement), is obviated by the present amendment. 

In regard to the Examiner's criticisms of the claims in relation to SEQ ID NO: 11, 
Applicants note that this issue is now moot in view of the deletion of this sequence by 
amendment herein. 

Turning to the claims as currently presented, Applicants wish to offer the following 
remarks. In particular, to address the Examiner's assertion that "extensive random 
unpredictable experimentation would be required to determine useable peptide molecules for 
a given purpose," Applicants offer the following: 

MPEP §2164.04 states: 

A specification disclosure which contains a teaching of the manner 
and process of making and using an invention in terms which 
correspond in scope to those used in describing and defining the 
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subject matter sought to be patented must be taken as being in 
compliance with the enablement requirement of 35 U.S.C. 1 12, 
first paragraph, unless there is a reason to doubt the objective truth 
of the statements contained therein which must be relied on for 
enabling support. 

Applicants submit that the present specification provides ample description and 
support for the skilled artisan to produce polypeptide molecules containing at least 50 
consecutive amino acids of SEQ ID NO:3, since the amino acid sequence of SEQ ID NO: 3 is 
provided in the present specification. Further, the skilled artisan may, without undue 
experimentation, easily synthesize at least 50 consecutive amino acids according to the 
methods described at least on page 1 1 of the specification; i.e., by the methods of Houber- 
Weyl or R.D. Merrifield. These methods have been known in the art for at least 3 1 years! 

Following synthesis, the particular peptides can be tested for their ability to confer 
immunity against malaria as described throughout the Examples in the present specification. 
More particularly, the skilled artisan would initially, after synthesis, try to test the peptides in 
mice as illustrated in Example 6 of the specification. After initial screening, the peptides can 
be fiirther tested in chimpanzees as exemplified in Examples 2 (2.1 to 2.5), 4 (4.1 to 4.4) and 
7. Thus, there are numerous working examples in the present specification that would guide 
the person skilled in the art to test for immunogenicity. 

Furthermore, the specification provides additional guidance with respect to the 
particular peptides that should be chosen. For example, the specification describes that B and 
T responses are important and have been identified in the non repeat proteins of the 5' and 3' 
regions of LSA-3 (page 19). At least Example 3 of the specification describes ways to 
identify CTL epitopes and that observed protection that depended on cellular responses is 
important for a vaccine. 
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Moreover, on page 35 the specification describes that inhibitory activity is displayed 
in the development of Plasmodium falciparum with interferons. Thus, the person skilled in 
the art can also test to determine whether the peptides have this type of activity. 

Thus, Applicants submit that the specification provides enough guidance to permit the 

skilled artisan to obtain and test the polypeptide molecules containing at least 50 consecutive 

amino acids of SEQ ID NO: 3 without undue experimentation and, as such, satisfy the 

requirements of MPEP §2164.04. 

Applicants caution that the quantity and complexity of testing should not be a criteria to 

render a rejection under enablement. As set forth in Massachusetts Institute of Technology v. 

AB Fortia, 774 F. 2d 1 104, 227 USPQ 428 (Fed. Cir. 1985): 

[T]he fact that experimentation may be complex. . .does not necessarily 
make it undue, if the art typically engages in such experimentation. 

In fact, MPEP §2164.06 states: 

. . . quantity of experimentation needed to be performed by one skilled in 
the art is only one factor involved in determining whether "undue 
experimentation" is required to make and use the invention, "[A]n 
extended period of experimentation may not be undue if the skilled artisan 
is given sufficient direction or guidance." In re Colianni, 561 F.2d 220, 
224, 195 USPQ 150, 153 (CCPA 1977). " The test is not merely 
quantitative, since a considerable amoimt of experimentation is 
permissible, if it is merely routine, or if the specification in question 
provides a reasonable amount of guidance with respect to the direction in 
which the experimentation should proceed. 

Applicants submit that, with the present specification in hand, determination of 
polynucleotide sequences that fall within the scope of the present invention would require 
nothing more than routine experimentation, even though the protocols may appear to be 
complex. As such, Applicants submit that the claims of the present application are fully 
enabled within the context of 35 U.S.C. §112, first paragraph. 

Based on the foregoing. Applicants submit that the present claims are fully described 
and enabled by the specification, especially when viewed in conjunction with the common 
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knowledge available in the art. As such, withdrawal of these grounds of rejection is 
requested. 

The rejection of Claims 27-29, 35, 39, 43, 44 and 48 under 35 U.S.C. §112, second 
paragraph, is obviated by amendment. 

Applicants have amended the claims to be free of the Examiner's criticisms. More 
specifically. Claim 27 has been amended to recite at least 50 amino acids of SEQ ID NO: 3. 
It should be clear from the language in this claim, that any polypeptide molecule having one 
or more polypeptides of SEQ ID NOS: 10, 12, 13, 14 and 15 are in fact excluded from the 
claim language. Since "all or part" has been deleted from this claim, it is clear that peptides 
shorter than at least 50 consecutive amino acids are not encompassed by the claim. 
Moreover, Claim 35 has been amended to ensure proper antecedent basis. 

In view of the amendments herein, withdrawal of this rejection is respectfiilly 
requested. 

The rejection of Claims 27-29 under 35 U.S.C. § 102 (a) over Barnes et al is 
traversed. 

The Examiner asserts that the argument filed on September 3, 2004, has been 
considered but is not persuasive. The Examiner further asserts, "Applicant cannot rely upon 
the foreign priority claim to overcome this rejection because the requirements of 35 U.S.C. 
§119 have not been met in. . . the instant application. The certified copy of the priority 
document is absent." 

Applicants submit that this assertion by the Examiner is incorrect in view of the 
record of the present application. On December 22, 2000, Applicants filed a Request for 
Priority (copy submitted herewith), in which Applicants claimed priority to FR9507007. 



14 



Application No. 09/742,096 

Reply to Office Action of December 3, 2004 

Further, in the Request for Priority, Applicants noted that a certified copy of the same was 
submitted to the International Bureau in PCT application PCT/FR96/00894. It was further 
noted that receipt of the certified copy by the Intemational Bureau in a timely manner under 
PCT Rule 17.1(a) was acknowledged as evidenced by PCT/IB/304 (copy submitted 
herewith). 

Applicants again note that Barnes et al was published in August 1995, whereas this 
application claims priority to French application 95/07007 filed on Jime 13, 1995. Since the 
certified copy of FR9507007 was timely submitted to the Intemational Bureau in PCT 
application PCT/FR96/00894, a certified English translation of this French application was 
filed in the parent application (US 08/973,462), and a certified English translation of this 
French application was again filed with the response on September 3, 2004 (copy of the date- 
stamped filing receipt is submitted herewith), the Examiner's rejection is not proper. 
Nonetheless, in order to ensure that no further errors occur in this regard. Applicants enclose 
herewith another copy of the certified English translation of the French priority document 
(FR9507007). Accordingly, Applicants submit that Barnes et al should not be prior art and 
this ground of rejection should be withdrawn. 

Acknowledgment of withdrawal of this rejection is respectfully requested. 

The rejection of Claims 27-29, 35, 39 and 43 under 35 U.S.C. § 102 (f), is respectfully 
traversed. 

In rendering this rejection, the Examiner deems that since U.S. Patent Nos. 
6,319,502 Bl and 6,270,771 have different inventors than the presently claimed invention 
and describe similar subject matter, the presently claimed inventors did not contribute to the 
above-captioned invention. Applicants disagree with the Examiner's conclusions for the 
reasons already of record. 
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It should be noted that Pierre Druihle is a common inventor for both the U.S. issued 
patents and the present invention. Furthermore, U.S. Patent Nos. 6,270,771 Bl and 
6,319,502 do not disclose the entire LSA-3 sequence, which is the subject of the present 
application. Moreover, the partial LSA-3 sequence disclosed in these issued patents is 
furthermore excluded from the claims. 

To further evidence that the present invention was not derived from Ms. Claudine 
Guerin-Marchand, Applicants provide the enclosed Rule 132 Declarations executed 
independently by Pierre Druilhe (the Druilhe Declaration) and Pierre Daubersies (the 
Daubersies Declaration). In the Druilhe Declaration, the history of the joint invention of Ms. 
Claudine Guerin-Marchand and Mr Pierre Druilhe appearing in U.S. Patent Nos. 
6,319,502 Bl and 6,270,771 is described. Also in the Druilhe Declaration, Mr. Pierre 
Druilhe describes the role of the inventors of the present application. Further, in the 
Daubersies Declaration, Mr. Pierre Daubersies describes his role in the invention claimed in 
the present application. Therefore, Applicants submit that the presently claimed invention 
was not derived from Ms. Claudine Guerin-Marchand. 

In view of the foregoing, withdrawal of this rejection is respectfully requested. 

The rejections of (a) Claims 27-29, 35, 39 and 43 under 35 U.S.C. § 102 (b) over 
Guerin-Marchand et al (W092/13884) in the light of the translation provided in U.S. Patent 
No. 6,270,771, and (b) Claims 27-29, 35, 39 and 43 under 35 U.S.C. § 102 (e) over Guerin- 
Marchand (U.S. Patent No. 6,319,502) are respectfully traversed. 

U.S. Patent No. 6,270,771 discloses various sequences from LSA-1, which have the 
following repetitive sequence: 

X1DLEQX2RX3AKEKLQX4QQQ 
where Xi is Ser or Arg 
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X2 is Glu or Asp 
X3 is Arg or Leu 
X4 is Glu or Gly. 

This sequence is not present in SEQ ID NO: 3 of the present invention. Thus, even 
sequences which are longer that include the above sequence are not claimed in the present 
invention. 

The other sequences described in U.S. Patent No. 6,270,771 are derived from the 
clone DG729S. The 729Sprotein is described at column 5 as SEQ ID No:24 and has 151 
amino acids. As pointed out by the Examiner, SEQ ID NO: 1 1 of the present invention is not 
disclosed in SEQ ID NO: 3. The remaining sequence of SEQ ID NO: 24 in the prior art is 
excluded from the presently claimed invention as SEQ ID NO: 10. 

The remaining sequences in U.S. Patent 6,270,771, which are described are SEQ ID 
NO: 25, which is the equivalent sequence to excluded SEQ ID NO: 12 of the present 
invention; SEQ ID NO:26, which is the equivalent excluded sequence of SEQ ID NO: 13 of 
the presently claimed invention; SEQ ID NO: 27, which is the equivalent to the excluded 
SEQ ID NO: 14 of the presently claimed invention; and SEQ ID NO: 28, which is the 
equivalent to the excluded SEQ ID NO: 15 of the presently claimed invention. 

Furthermore, SEQ ID NOS: 25 to 28 have amino acid sequences that are less than 50 
consecutive amino acids. As pointed out by the Examiner, U.S. Patent 6,270,771 discloses 
that the invention encompasses "any sequence comprising 4 to 5 amino acids up to the 
maximal number of the sequences described above." Since, SEQ ID NOs. 25 to 28 have 47, 
26, 27 and 27 total amino acids, these sequences cannot be said to encompass a polypeptide 
molecule having at least 50 consecutive amino acids of SEQ ID NO: 3 of the present 
invention. 
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Therefore, U.S. Patent No. 6,270, 771 fails to disclose or suggest the presently 
claimed invention. In order for a reference to anticipate an invention, the reference "must 
teach every element of the claim" (MPEP §2131). Accordingly, U.S. Patent No. 6,270, 771 
does not anticipate the invention as presently claimed. 

For the same reasons as stated above, the presently claimed invention is not 
anticipated by U.S. Patent 6,319, 502, v^hich is a divisional of U.S. 6,270,771. Since both 
issued patents have the same specification, U.S. Patent 6,319, 502 does not anticipate the 
claimed invention. In summary U.S. Patent 6,319,502 teaches sequences that are not 
presently being claimed, are excluded from the claims or are not of sufficient length to 
anticipate the presently claimed invention. 

Therefore, in view of the above, withdrawal of these rejections is respectfully 
requested. 

Finally, Applicants remind the Examiner that MPEP §821.04 states: 

. . .if applicant elects claims directed to the product, and a product 
claim is subsequently found allowable, withdrawn process claims 
which depend from or otherwise include all the limitations of the 
allowable product claim will be rejoined, (emphasis added) 

Accordingly, should the elected invention be found allowable. Applicants request that 

withdrawn process claims be rejoined and examined. 
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Applicants submit that the application is in condition for allowance. Early notice to 
this effect is earnestly solicited. 

Respectfully submitted, 

OBLON, SPIVAK, McCLELLAND, 
MAIER & NEUSTADT, P.C. 
Norman F, Obion 




Vincent K. Shier, Ph.D. 
Registration No. 50,552 



Customer Number 

22850 

Tel: (703)413-3000 
Fax: (703)413-2220 

(OSMMN 08/03) 
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PATENT 

Attorney Docket No.200773USODiv 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Patent Application of 



Pierre DRUILHE etal 



Serial No. :09/742,096 



Filed: Decennber 22, 2000 



Group Art Unit: 1641 



Examiner: James Leslie GRUN 



For: MALARIAL PRE-ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 



1.132 Declaration 



Commissioner for Patents 
P.O. Box 1450 

Alexandria. Virginia 22313-1450 



I, Pierre Daubersies do hereby declare the following: 

(1) I Am currently working in the Research and Development 
Department at Merck LIPHA SANTE in France. Prior to being 
employed by Merck. I worked in the Laboratory of Parisitology 
at Institut Pasteur under the head of the Department. Mr. 
Pierre Druilhe. During the years of 1992 to 1994, I was a 
post-doctoral Fellow in Mr. Druihie's laboratory working on a- 
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project to clone, sequence and characterize LSA-3. My 
Curriculum Vitae is enclosed. I am one of the inventors of the 
above-captioned U.S. patent application. 



(2) In the present invention the entire /sa-3 gene was cloned and 
sequenced, which was the goal of my post-doctorate studies.. 
To clone and sequence the entire lsa-3 gene was in fact quite 
difficult since the genome of Plasmodium falciparum is very rich 
in A: T bases (80%), thus there is a rarity of restriction sites 
which can be used and also instability of cloning certain 
fragments when they are inserted into plasmid vectors. Thus, it 
took further ingenuity to obtain full length LSA-3, analyze its 
sequence, as well as to determine whether there was any 
polymorphism in different isolates, which is generally the case 
between different clones, strains and isolates of Plasmodium, as 
well as to determine the antigenicity in chimpanzees of LSA-3 
and fragments thereof. 

(3) Secondly, I am one of the true inventors of the claimed subject 
matter of the present patent application and did not derive the 
claimed subject matter from Ms. Claudine Guerin-Marchand, who 
at that epoch was working in the same laboratory. Although Ms. 
Claudine Guerin-Marchand did obtain a clone 729S while 
working under the supervision of Professor Druilhe. this was only 
a 151 amino acid antigen of LSA-3, which is a short fragment. 
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(4) I further declare that all statements made herein of my own knowledge are 
true and that all statements made on information and belief are believed to be true; 
and further that these statements were made with the ? knowledge that willful false 
statements and the like so made are punishable:eby fine or imprisonment, or both, 
under Section 1001 of Title 18 of the United States Code and that such willful false 
statements may jeopardize the validity of this application or any: patent issuing 
thereon. 




Date Pierre Daubersies 
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PATENT 

Attorney Docket No.200773USODiv 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re Patent Application of 



Pierre DRUILHE et al 



Serial No.:09/742,096 



Filed: Decennber 22, 2000 



Group Art Unit: 1641 



Examiner: James Leslie GRUN 



For: MALARIAL PRE-ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 



1.132 Declaration 



Commissioner for Patents 
P.O. Box 1450 

Alexandria. Virginia 22313-1450 



I, Pierre Druiihe do hereby declare the following: 



(1 ) I am currently the director of the Biomedical Parasitology Unit at Institut 
Pasteur in Paris, France and have been the director of this department 
for many years. As can be seen from my attached Curriculum Vitae, I 
have published 260 articles and have 10 U.S. Patents issued in my 
name. I am one of the inventors of the above-captioned U.S. patent 
application. 
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(2) The primary scientific interest in my laboratory is focused on the 
understanding of Plasmodium falciparum/human immune reactions, 
both at the level of the pre-erythrocytic stages (sporozoite and liver 
stage), as well as the merozoite stage. The main activities with these 

. scientific studies in my laboratory are aimed at vaccines to treat or 
prevent malaria, as well as drug discovery in this area. 

(3) As head of a laboratory at Institut Pasteur, I supervise and am 
responsible for many researchers, scientific trainees, as well as other 
personnel such as laboratory technicians and office staff. My 
laboratory also collaborates on particular scientific projects with various 
scientific groups not associated with Institut Pasteur. 

(4) I have read and understood the last U.S. Official Action dated 
December 3, 2004. As I understand, the Examiner has rejected claims 
27 to 29. 35, 39 and 43 on the basis that Mr. Pierre Daubersies and I 
did not invent the subject matter which is currently claimed and sought 
to be patented. It is my further understanding that the rationale behind 
this rejection is that issued U.S. Patents 6,270,771 B1 and 6. 319,502 
B1 have myself and Claudine Guerin-Marchand as inventors, while in 
the above present patent application the other inventor is Pierre 
Daubersies. Since a partial sequence of a 729S protein was disclosed 
in issued patents 6,270,771 and 6,319,502, the Examiner maintains 
that the present inventorship is not correct. I disagree with the 
Examiner's conclusions for the following reasons. 

(5) U.S. Patent Nos. 6,270,771 and 6,319,502 are based on the discovery 
and sequencing of peptides from LSA-1 (liver stage antigen 1). At that 
epoch and as described in those patent specifications, one clone that 
was derived from our extensive search of the genomic library of 
Plasmodium falciparum was directed to a protein different from LSA-1 ; 
i.e.. LSA-3 clone DG729S. The sequencing of this clone revealed a 
polypeptide of 151 amino acids, which is only a partial sequence of the 
LSA-3 protein. This scientific work was in fact the conception of myself 
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and Ms. Claudine Guerin-Marchand, who was a researcher in my 
laboratory at that time. After appropriate evaluation by Institut 
Pasteur's Patent and Invention Office, it was decided that Claude 
Guerin-Marchand and myself were inventors of at least one claim for 
U.S. Patent 6,270.771 and 6.319.502. 

(6) In contrast, in the present invention the entire lsa-3 gene was cloned 
and sequenced. To clone and sequence the entire /sa-3 gene was in 
fact quite difficult since the genome of Plasmodium falciparum is very 
rich in A: T bases (80%), thus there is a rarity of restriction sites which 
can be used and also instability of cloning certain fragments when they 
are inserted into plasmid vectors. Thus, it took further ingenuity to 
obtain full length LSA-3. analyze its sequence, as well as to determine 
whether there was any polymorphism in different isolates, which is 
generally the case between different clones, strains and isolates of 
Plasmodium, as well as to determine the antigenicity in chimpanzees of 
LSA-3 and fragments thereof. 

(7) In this respect, a post-doctoral fellow, Mr. Pierre Daubersies undertook 
the project to clone LSA-3 with myself. After appropriate evaluation by 
Institut Pasteur's Patent and Invention Office, it was concluded that 
myself and Mr Duabersies were considered inventors for at least one 
claim in the present patent application. 

(8) I am one of the true inventors of the claimed subject matter of the 
present patent application and did not derive the invention from Ms. 
Claudine Guerin-Marchand. 
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(9) I further declare that all statements made herein of nny own 
knowledge are true and that all statements made on information and 
belief are believed to be true; and further that these statements were 
made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under Section 
1001 of Title 18 of the United States Code and that such willful false 
statements may jeopardize the validity of this application or any patent 
issuing thereon. 



Date 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

IN RE APPLICATION OF: Pierre DRUILHE et aL GAU: 
SERIAL NO: NEW APPLICATION EXAMINER: 
EREWITH 

:.ARIAL PRE-ERYTHROCYTIC STAGE POLYPEPTIDE MOLECULES 

REQUEST FOR PRIORITY 

MISSIONER FOR PATENTS 
,D.C. 20231 

SIR: 

■ Full benefit of the filing date of U.S. Application Serial Number 08/973,462, filed February 6, 1998, is claimed pursuant to 
the provisions of 35 U.S.C. §120. 

O Full benefit of the filing date of U.S. Provisional Application Serial Number , filed , is claimed pursuant to the provisions of 
35 U.S,C. §1 19(e). 

■ Applicants claim any right to priority from any earlier filed applications to which they may be entitled pursuant to the 
provisions of 35 U.S.C. §119 and §120, as noted below. 

In the matter of the above-identified application for patent, notice is hereby given that the applicants claim as priority: 

rOfJNTRY APPI.TCATTON NUMBER MONTH/DAY ATT. AR 

France 95/07007 June 13, 1995 

WIPO PCT/FR96/00894 June 12, 1996 



Certified copies of the corresponding Convention Application(s) 

□ are submitted herewith 

□ will be submitted prior to payment of the Final Fee 

□ were filed in prior application Serial No. filed 

■ were submitted to the International Bureau in PCT Application Number PCT/FR96/00894. 

Receipt of the certified copies by the International Bureau in a timely manner under PCT Rule 17.1(a) has been 
acknowledged as evidenced by the attached PCT/IB/304. 

□ (A) Application Serial No.(s) were filed in prior application Serial No. filed ; and 
(B) Application Serial No.(s) 

□ are submitted herewith 

□ will be submitted prior to payment of the Final Fee 
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Respectfully Submitted, 



K, McClelland, 

T, P.C. 




Norman F. Obion 
Registration No. 24,6 1 8 

James J. Kelly, Ph.D. 
Registration No. 4 1 ,504 
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Expediteur : le BUREAU INTERNATIONAL 



PCT 

NOTIFICATION RELATIVE A LA PRESENTATION 
DU DOCUMENT PRIDRITP 

(instruction administrative 411 du PCT) 


Destinataire: 

GUTMANN, Ernest 
Yves Plasseraud S.A. 
3, rue Chauveau-Lagarde 
F-75008 Paris 
FRANCE 


Date d*expedition (jour/mots/a nnee) 
24 juillet 1996 (24.07.96) 


Reference du dossier du deposant ou du mandataire 
B2775A-FL 


NOTIFICATION IMPORTANTE 


Demande intemationale no Date du depot international Date de priorit6 

PCT/FR96/00894 12 juin 1996 (12.06.96) 13 juin 1995 (13.06.95) 


Deposant 

INSTITUT PASTEUR etc 



La date de reception par le Bureau international du ou des documents de priorite correspondant ^ ta ou aux demandes suivantes 
est notifiee au deposant: 



Demande anterieure no: Date de priorite: 

95/07007 13 Jul 1995 (13.06.95) 



Pays dans lequel ou pour lequel Date de reception du 

la demande a ete deposee: document de priority 



FR 



23 jui 1996 (23.07.96) 




Bureau International de TOMPI 
34, chemin des Colombettes 
121 1 Geneve 20, Suisse 

no de telecopieur: (41-22) 740.14.35 



Formula ire PCT/IB/304 Ouillet 1992) 



Fonctionnaire autorise: 

G. Bahr 

node telephone: (41-22) 730.91 . 1 1 




VERIFICATION OP A TRANSLATION 




I» the below named translator, hereby declare that: 
My name and post office address are as stated below: 

That I am knowledgeable in the English language and in the language in iTvliich the below 
identified French Application was filed, and that I believe the English translation of the 
French Application N*" 9507007 filed on 13/06/95 in the name of Ihstitut Pasteur 

is a true and complete translation of the above identified French Application as filed. 



Date: 



Full name of the translator 



Vdronique MARCAD6 



Signature of the translator 




post Office Address 



3, rue Chauveau-Lagarde 



F.75008 Paris 
FRANCE 



POLYPEPTIDE MOLSCULSS OF THE PRE -ERYTHROCYTIC STAGE OF 

The parasites responsible for malaria in man 
display different morphologies in the hxunan host and 
expresd different antigens depending on their location 
in the body. The morphological and antigenic 
differences of these parasites during their life cycles 
in man enable different stages of development in the 
liver and in the blood to be defined; the aporozoite, 
the infectious form injected by the vector mosquito, 
transforms rapidly into a schizont in the host's 
hepatocytes and thereafter infects the erythrocytes . 
The intrahepatic localization of P , f alcipaarum manifests 
itself in the expression of a group of antigens 
specific to this stage of development and which are 
highly immunogenic under the natural conditions of 
exposure to the disease. This clinically silent phase 
is at present the only one against which a very strong, 
sterilizing immunity can be induced experimentally in 
man, by injecting irradiated sporozoites capeible of 
entering the hepatocyte and of developing therein but 
without being able to lead on to the blood stage of the 
disease. Accordingly, the inventors have concentrated 
the bulk of their efforts on these two pre -erythrocytic 
stages. However, these stages are also the most 
intricate ones to study, and hence the least under- 
stood, since it is difficult or even impossible to 
obtain biological material, the only in vitro study 
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model affords a very lo%^ yield and the best animal 
model remains the chimpanzee, the use of which is 
limited and es^enaive. 

In order to gain access to the antigens of the 
5 pre-erythrocytic stages , the inventors used sera of 
individuals who had resided for 25 years in a region 
where the disease is endemic but who were on permanent 
prophylaxis with chloroquine- These individuals were 
regularly subjected to infected mosquito bites but did 
10 not develop any complete blood infection. Their serum 
hence contained antibodies directed essentially against 
the pre^erythrocytic stages, which was verified by 
irrnnunof lucre scence (IP) and western blotting on the 
3 stages of the parasite . 

15 The use of these sera for screening a library 

of genomic DNA of the parasitic clone of P, falciparum , 
the library being constructed in expression vectors in 
a phage lambda gtll (V. Rosario, Science 212 , 1981, 
pp. 1037-1038; and Thaithong et al - , Transactions of 

20 Royal Society of Tropical Medicine and Hygiene, 1984, 
78s242-245), led to the demonstration of polypeptides 
of the pre -erythrocytic stage, in particular the SAliSA 
(sporozoite liver stage antigen) polypeptides described 
in BP A-0, 407,230 and LSA-l (liver stage antigen) 

25 described in WO 92/13884. The present invention relates 
to new polypeptide molecules specific to the pre- 
erythrocytic stage, and to their use as active 
principle of antimalarial vaccine or in methods of 
diagnosis of the disease. 

30 The invention is the outcome of the 

demonstration by the inventors of the special 
properties of a particular antigen referred to LSA-3 
and of its fragments, trfhich are seen to be candidates 
tf^ith a strong potential for producing an antimalarial 

35 vaccine, for the following reasons ^ 

a) when a fraction of IjSA-3 was used in combination 
with another antigen of the same stage of development 
of the parasite, such as LSA-1, to immunize 
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chimpanzees, the animal responding to both molecules or 
only to IjSA-3 displays the feature of not having 
parasites in the bloody of having a substantial 
decrease of the parasites in the liver and of manlfest- 
5 ing a substantial recruitment of mononuclear cells 
indicating a response in terms of cellular immunity; 

b) in regions T;^here the disease is endemic, a very 
clear correlation is observed between the protection of 
individuals against natural infection by sporozoites 

10 and their responses in terms of antibodies against 
IiSA-3; 

c) in eight human volunteers immunized by injection of 
irradiated sporossoites, antibodies directed against 
LiSA-3 are found in each of the four individuals 

IS resisting sporozoite infection and in none of the other 
four volunteers who developed a blood infection « 

d) antibodies obtained against the peptide DG729 in 
WO 92/13884, already described, give a cross -react ion 
with the sporozoite and liver stages of tlhe murine 

20 parasite P.yoelii ^ which permit© a significant 
e^qploitation of the mouse model. In vitro , the human 
antibodies immunopurif ied on DG729 are capable, even at 
very low concentrations, of blocking the entry of 
P^yoelii sporozoites into mouse hepatocytes » In vivo , 

25 mice immunized with DG729 are fully or partially 
protected against infection by P ,j^oel i i sporozoites; 

e) lastly, some epitopes, in particular in the non- 
repeat portions of the molecule^ stimulate the 
secretion of interferon-y by monocytes ^ this mediator 

30 enabling the intrahepatic development of the parasite 
to be inhibited (S, Mellouk et al o , The Jour, Of Immun. 
139 , 4192-4195, 1987); 

All these properties, some of fefhich will be 
demonstrated in detail in the experiments described 
35 later, shov^ that the LSA-3 antigen displays both good 
antigenicity and good immunogenicity. 

The inventors were able to confirm and define 
the specificity of the stages of expression of the 



molecule; in the sporozoites, this esqpreseioix was 
confirmed by the surface immuyiof luorescence of several 
stsraimia and isolates. In !i?eatesm Blot analysis, the 
molecule appears as a protein of molecular weight 
200,000 daltons. While the messenger RN^s of 
sporozoites could not be obtained in sufficient a?nounts 
for a northern blot analysis, reverse PGR eacperiments 
confirmed the expression of IiSA-3 at this stage. In 
infected hepatocytes, is observed in the 

parasitophorous vacuole of the parasite by immuno- 
fluorescence using antibodies against the repeat and 
non-repeat regions of the protein, as well as by 
electron microscopy. 

A fragment of designated 72 9S, as v^&ll as 

three peptides designated NRI and NRII included in the 
non-repeat portion and 729R included in the repeat 
portion, have been described in Application 
WO 52/13884 • Nevertheless, this document does not 
mention the special properties mentioned above, or 
other fragments of LSA-S lafhich could be either longer 
or shorter, included or contdoined with these fragments, 
which might display especially advantageous properties 
for use in vaccines o 

The siyibject of the invention is polypeptide 
molecules containing at least ten consecutive amino 
acids of the amino acid sequence shown in Figure 2 and 
designated SEQ ID NOo 2, and representing LiSA-3, the 
following polypeptides being excluded s 

- RDELFNELmSVDWGEVKENILEESQVm^DIFNSLVKSVQQEQQHNV^ 

- VEESVEENDEES VEENVEEE^VENNDDGS VASSVEES lASSVDBS IDSS I E - 
KWAPTVEEIVAPTVEEIVAPSVVEKCAPSVEESVAPSVEESVAEB«LjKER 
<72 9S) 

- RDELF^LI^SVDVNGEVKENILEESQVNODIFNSLVKSVQQEQQH^ 

- DELFNELLNSVDVNGEVKENIIjEESQ, iimi) 

- LEESQVNDDIFSNSIjVKSVQQEQQHW, (NRII) 

- VESVAPSVEESVAPSVEESVAEWVEESV- (729RE) 



other molecules according to the invention 
contain at least 2 0 consecutive amino acids or at least 
SO. 

This set of polypeptides and the LSA-3 molecule 
are, throughout hereinafter, ^'polypep tides of the 
invention" . 

The experimental results and the comparisons of 
sequences between different P, falciparum isolates 
indicate the existence of at least 70% homology between 
equivalent antigens of the liver stage of the parasite. 
Thus any peptide molecule displaying at least 70% 
homology with any one of the molecules defined eibove 
forms part of the invention, as do those displaying at 
least 70% homology with the following sequence: 
Leu Leu Ser Asn lie Glu Glu Pro Lys Glu Asn lie lie Asp 
Asn Leu Leu Asn Asn lie (CTl) 

lying between amino acids 140 and 159 of Kl oir 23 and 
42 of T9/96. 

Likewise forming part of the invention are the poly- 
peptide molecules displaying at least 70% homology with 
the sequence depicted in Figure 3, which depicts a 
portion of LSA-3 in T9/96s the DNA of this P„ falciparum 
isolate was digested with restriction enzymes, then 
cloned into lambda gtll and thus enabled the gene 
library of this isolate, already described above, to be 
constituted. 

Conjugates consisting of a polypeptide 
originating from LSA*3 linked covalently via a lysine 
bridge to saturated or unsaturated lipid residues also 
form part of the invention, more especially when the 
lipid residue is a palmitoyl or a palraityl or an oleyl* 
Ci6 or Cid residues were thus coupled via a lysine bridge 
to the peptides NRI, NRII, 729RE and CTl already 
depicted above. The method of synthesis used for these 
conjugates is described in Bourgault, LToumal of 
Immunology, 149, 3416 (1992) and Rouaix, Vaccine, 12, 
1209 (1994) , 



The invention also covers immunogenic composi- 
tions containing at least one polypeptide tt»lecule or 
one conjugate described above, as well as the vaccines 
containing these immunogenic compositions. Other 
5 immunogenic epitopes, in particular LSA^l, SALSA, 
STARP, have already been described in EP A- 04 07230 and 
in WO 92/13884 o The vaccine con^ositions according to 
the invention can advantageously contain a mixture of 
immunogenic peptides originating from LSA-3 and of the 

10 peptides or antigens originating from LSA-1, SALSA or 
STARP; a more especially advantageous mixture could be 
the one consisting, on the one hand of NRI, MRU or 
whole LSA-3, these being coupled or otherwise to a 
lipid residue, and on the other hand the peptides 

15 SAIiSA-1, SALSA-2 or the SALSA antigen coupled or 
otherwise to a lipid residue . 

All polypeptide molecules corresponding to the 
above definition and displaying at least 70S homology 
with the polypeptides LSA-3, CTl, MRI, mil or 729RE 

20 may be combined in homologous or heterologous fashion 
with other peptide seq^uences or sequences originating 
from another antigen of the different stages of 
falcipanmi . 

The invention also covers the polyclonal or 
25 monoclonal antibodies which specifically recognize the 
polypeptide molecules of the invention. 

These molecules of the invention may be used 
for carrying out diagnostic methods and producing kits 
enabling the existence of Po falciparum infection to be 
30 detected; this method can be either an assay of 
circulating specific antibodies, by carrying out 
standard serological methods by bringing one of the 
above antigens into contact with a biological fluid of 
the individual in question, or method© of assay of 
35 antigens using polyclonal or monoclonal antibodies 
obtained by standard methods for obtaining such 
antibodies with the corresponding antigens. In the 
diagnostic outfits or kits of the invention, the 



reagents enabling the antigen/ antibody complexes 
produced to be detected, which can also carry a label 
or be capable of being recognized in their turn by a 
labelled reagent, are present. Depending on whether it 
is desired to carry out an antigen test or a sero- 
logical test, the kit comprises either the antibodies 
or the antigens of the invention. 

The invention also covers all the nucleotide 
sequences coding for a polypeptide of the invention, as 
well as any recombinant nucleic acid containing at 
least one nucleotide sequence of the Invention, 
inserted into a nucleic acid which is heterologous with 
respect to the said nucleotide seqfuence. 

The nucleic acid sequences coding for LSA-S or 
its immunogenic fragments and corresponding to one of 
the following definitions form part of the invention: 

(a) the linked succession of nucleotides as depicted in 
SEQ ID No, 1 of Figure 1, or 

(b) the linked succession of nucleotides depicted in 
SEQ ID No. 2 of Figure 2, 

(c) a linked succession displaying at least 70% 
homology with that of Figure 1 or of Figure 2, or 

(d) a linked succession of nucleotides which are 
coniplementary to those presented in (a) , (b) or (c) . 

The expression "coding for is understood 

to refer both to the gene depicted in SEQ ID No. 1 of 
Figure 1 and the cDNA depicted in SEQ ID No. 2 of 
Figure 2. 

The invention relates more especially to a 
recombinant nucleic acid in which the nucleotide 
sequence of the invention is preceded by a promoter (in 
particular an inducible promoter) , under the control of 
which the transcription of the said sequence is capable 
of being performed, and, where appropriate, followed by 
a sequence coding for transcription termination 
signals. 
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The invention also covers the coding sequence 
originating from the clone T9/96 depicted in Figure 3 
by SEQ ID No. 3, 

In this sequence, the fragment CTl lies between 
5 nucleotides 67 and 126, the fragment 679 begins at 
nucleotide 206 and the fragment 729RE; lies between 
nucleotides 547 and 630. 

Lastly, the invention covers any recombinant 
vector used especially for the cloning of a nucleotide 
10 sequence of the invention, and/ or for the expression of 
the polypeptide encoded by this sequence, and 
characterized in that it contains a recombinant nucleic 
acid as defined above in one of its sites which is not 
essential for its replication* 
15 As an example of an abovementioned vector, 

plasmids, cosmids, phages or viruses may be mentioned. 

As such, the invention relates more especially 
to the plasmid pK 1.2. deposited at the CNCM \inder the 
No, 1-1573* 

20 The BiJibject of the invention is also a method 

for preparing a polypeptide of the invention, by 
transformation of a cell host using a recombinant 
vector of the abovementioned type, followed by the 
culturing of the cell host thus transformed and the 

25 recovery of the polypeptide in the culture medixxm* 

Thus, the invention relates to any cell host 
transformed by a recombinant vector as defined above, 
and comprising the regulatory elements permitting the 
expression of the nucleotide sequence coding for a 

3 0 polypeptide according to the invention. 

The invention likewise covers dna (or rna) 
primers which can be used in the context of the 
synthesis of nucleotide and/or polypeptide sequences of 
the invention, by the PGR (polymerase chain reaction} 

35 technique or any other method known at the present time 
for amplifying nucleic acids, such as LCR, CPR, ERA, 
SPA, NASBA, and the like* 
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The invention relates to any DNA or RNA primer, 
characterized in that it coneiets of approximately 10 
to 25 nucleotides which are identical or con^>lementaxy 
to the first 10 to 25 nucleotides of the nucleotide 
5 sequence coding for a peptide sequence according to the 
invention, or identical to the last 10 to 25 
nucleotides of the said sequence. 

Thus, the present invention also covers a 
method for preparing a polypeptide of the invention 

10 comprising the following steps: 

- where appropriate, the prior amplification by 
standard techniques of the amount of nucleotide 
sequences coding for the said polypeptide using two 
suitably chosen DNA primers, 

15 - the culturing, in a suitable culture medium, of a 
cell host previously transformed by a vector containing 
a nucleic acid according to the invention comprising 
the nucleotide sequence coding for the said 
polypept ide , and 

20 - the recovery from the abovementioned culture medium 
of the polypeptide produced by the said transfoarmed 
cell host. 

By way of example of DNA or RNA primers 
according to the invention, the following pairs of 

25 sequences may be mentioned: 

SI: GTGATGAACTTTTTAATGAATTATTAAA (SEQ ID No. 4) 

S2: TGTTGTTCTTGTTGAACACTTTTTACTAA <SEQ ID No- 5) 

whose respective positions on the IiSA-3/Kl gene 

depicted in Figure 1 are from 695 to 722 and from 82 9 

30 to 799 (reading in the reverse direction) , or the pair: 
6.1: GGTATCGAAACTGAGOAAATAAAGG (SEQ ID No. 6) 
6.2: CATAGCAGGAACATCAACATCCAC (SEQ ID No. 7) 
whose respective positions are 2668 to 2692 for 6.1 and 
3456 to 3433 for 6*2 (reading in the reverse 

35 direction) . 

The peptides of the invention may also 
be prepared by the standard techniques of peptide 
synthesis. This synthesis may be carried out in 
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homogeneous solution or in the solid phase. For 
exanple, use may be made of the technique of synthesis 
in homogeneous solution described by Hovxben-Weyl in the 
work entitled "^Methoden der Organischen Chemie^ 
(Methods in Organic Chemistry) edited by Eo Wunsch, 
vol. 15-1 and II, Thieme^ Stuttgart 1974, or that 
described by R.D. Merrifield in the paper entitled 
^Solid phase peptide synthesis'' (Jo Am. Chem- Soc^ 45, 
2149-21541) , 

The invention also covers the water-soluble 
oligomers of the abovementioned monomeric peptides. 

Oligomer! sat ion can cause an enhancement o£ the 
immunogenicity of the monomeric peptides according to 
the invention o While such numerical information cannot 
be regarded as limiting, it may nevertheless be 
mentioned that these oligomers can, for example, 
contain from 2 to lO monomer units. 

To carry out the oligomerization, use may be 
made of any polymerisation technique commonly used in 
the peptide field, this polymerisation being conducted 
until an oligomer or polymer containing the requisite 
number of monomer motifs for acquiring the desired 
immunogenicity is obtained « 

One method of oligomerization or polymerization 
of the monomer consists in reacting the latter with a 
crosslinking agent such as glutaraldehyde , 

Ua@ may also be made of other oligomerization 
or coupling methods, for example the one employing 
successive couplings of monomer units via their 
carboxy- and amino- terminal functions in the presence 
of homo- or he terobi functional coupling agents » 

The invention also relates to the conjugates 
obtained by covalent coupling of the peptides according 
to the invention (or of the abovementioned oligomers) 
to physiologically acceptable and non-toxic (natural or 
synthetic) carrier molecules that enable, in 
particular, the immunogenicity to be increased, via 
co«5>lementary reactive groups carried, respectively, by 



-li- 
the carrier molecule and the peptide. By way of example 
of macromolecular carrier molecules or supports which 
participate in the constitution of the conjugates 
according to the invention, there may be mentioned 
5 natural proteins such as tetanus toxoid, ovalbumin, 
serum albumins, haemocyanins, tuberculin PPD (PPD: 
purified protein derivative) , and the like. 

By way of synthetic macromolecular supports^ 
there may be mentioned, for example, poly lysines or 
10 poly (DL-alanine) -poly (L- lysine) s. 

By way of hydrocarbon or lipid supports, there 
may be mentioned saturated or unsaturated fatty acida, 
and preferably C^s or Cxh acids of the oleic or 
palmitoleic type. 
15 To synthesize the conjugates according to the 

invention, use may be made of methods which are known 
per se, such as the one described by Frantz and 
Robertson in Infect, and Immunity, 33^, 193-198 (1981), 
or the one described in Applied and Environmental 
20 Microbiology (October 1981), vol. 42, No. 4, 611-614 by 
P,E- Kauffman, using the peptide and the appropriate 
carrier molecule. 

The nucleic acids of the invention may be 
prepared either by a chemical method or by other 
25 methods. 

A suitable method of preparing the nucleic 
acids of the invention containing not more than 
200 nucleotides (or 200 bp in the case of double- 
stranded nucleic acids) comprises the following steps: 
30 - DNA synthesis using the automated p-cyanoethyl- 
phosphoramidite method described in Bioorganic 
Chemistry 4; 274-325 (1986), 

- cloning of the nucleic acids thereby obtained into a 
suitable vector and recovery of the nucleic acid by 
35 hybridization with a suitable probe, 

A chemical method of preparation of nucleic 
acids of length greater than 200 nucleotides has 
already been described in WO 92/13884. 
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The invention also relates to diagnostic kits 
which contain one or more atiplif ication primers 
specific for the LSA-3 gene and which enable the 
presence of the gene or of the mRM^ to be detected in 
an individual likely to be infected by P, falciparum . 

The invention also covers phartnaceutical or 
vaccine compositions in which at least one of the 
products according to the invention is present in 
combination with solid or liquid, pharmaceutical ly 
acceptable excipients suitable for the construction of 
oral, ocular or nasal dosage forms, or ekcipients 
suitable for the construction of dosage forms for 
rectal administration^ or alternatively with gelatinous 
excipients for vaginal administration. It also relates 
to isotonic liquid compositions containing at least one 
of the conjugates according to the invention^ suitable 
for administration to the mucosae, in particular the 
ocular or nasal or pulmonary mucosae » 

Advantageously^ the vaccine compositions 
according to the invention contain, in addition, a 
vehicle such as polyvinylpyrrolidone which facilitates 
the administration of the vaccine. In place of 
polyvinylpyrrolidone <r it is possible to use any other 
type of adjuvant, in the traditional sense which was 
formerly given to this expression , that is to say a 
substance which enables a medicinal product to be 
absorbed more readily or which facilitates its action 
in the body. By way of exan^les of other adjuvants of 
this latter type^ there may also be mentioned 
carboxymethylcellulose, aluminium hydroxides and 
phosphates, saponin or all other adjuvants of this type 
which are well known to a person skilled in the art. 
Lastly^ they contain, if necessary, an immunological 
adjuvant, in particular of the muramyl peptide type. 

The invention also relates to pharmaceutical 
compositions containing as active si^stance at least 
one of the polyclonal or monoclonal antibodies defined 
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above, in combination with a pharmaceutic ally accept- 
able vehicle. 

Additional features of the invention will also 
become apparent in the examples illustrated with the 
5 figures which follow, and show the special features of 
the molecules of the invention relative to other 
antigens of the pre -erythrocytic stage of the parasite. 

Figure 1 depicts the genomic DMA sequence 
ID No. 1 of 6152 base pairs of the LSA-3 gene; it 
10 originates from the clone Kl,2, which itself originates 
from a Thai isolate* 

Figure 2 depicts the cDNA sequence ID No. 2 and 
the polypeptide sequence of the LSA-3 antigen. The DNA 
sequence represents 5361 base pairs, 
15 Figure 3 depicts the sequence ID No. 3 of the 

portion sequenced in the parasite clone T9/96 
(1690 base pairs), the upper line being the nucleotide 
sequence and the lower line the peptide sequence. In 
this clone, the CTl sequence lies between nucleotides 
20 67 and 126, the actual fragment DG67d beginning at 
nucleotide 207. The fragment 729RB lies between nucleo- 
tides 547 and 629, 

Figure 4a depicts diagratnmatically the relative 
positions of the repeat and non-repeat sequences, the 
25 introns and the escons in strains Kl and T9/96, the 
clones 679 and 729 originating from the latter- 
Figure 4b depicts the HCP (hydrophobic cluster 
plot) of the peptide sequence of the clone IKS729, 

Figure 5 depicts the amounts of immunoglobulins 
30 produced in the sexrum of chinqpanzee Nuria before and 
after immunization with different LSA-3 peptides* 

Figure 6 shows the specific antibody titre of 
different species of mice immunized either with a 
peptide or with a corresponding lipopeptide, 
35 Figure 7 shows the inhibition of the sporozoite 

invasion of liver cells by hyperimmune sera obtained 
after immunization with different peptides and immuno- 
purified against whole LSA-3 . 
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Figure 8 depicts the comparison of an antigen 
originating from IjSA-3 with two other antigens with 
respect to type T immunity. 

Figure 9 depicts the induction of interferon-y 
in the chinpanzees Gerda and Dirk with the peptides 
originating from the LSA-3 molecule. 

Exan^le 1; Cloning and sequencing of the LSA-3 

gene 

1} Sequencing 

Initial screening of the gene library 
originating from the parasite clone T9/96 with the 
serum of a missionary treated continuously by 
prophylaxis enabled us to isolate 120 clones corres- 
ponding to molecules expressed at the sporozoite and/or 
iiver stage of the P ■ falciparum cycle - The clone 72 9S 
was used as probe to screen a genomic library of the 
Thai strain Kl already mentioned above, which contains 
large EcoR I fragments cloned into phage lambda gtlO. A 
6.85-kilobase insert containing the whole gene wa^ 
purified from this gene library and recloned into a 
pUC18 plasmid for sequencing and characteTri nation • In 
P , fa 1 c ipa rum , the genome of which is very rich in bases 
A:T(80%), this approach is often rendered difficult by 
the rarity of restriction sites which can be used^ and 
by the instability or even the impossibility of cloning 
certain fragments when they are inserted into plasmid 
vectors - 

The structure of the gene is depicted in 
Figure 4 and displays the following features; 

a) a mlni-exon 1 coding at its 3' end for a hydrophobic 
signal peptide; 

b) a short intron (168 base pairs) included between 
consensus splicing donor and acceptor sites; 

c) a second exon of five kilobases which codes for an 
organized region of 1.8 kilobases, and composed of an 
arrangement of 7 blocks of 4 amino acids and a 3' 
hydrophobic region which might correspond to an 
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anchorage of the glycoeylphoBphat idyl inositol (GPI ) 
type . 

A detailed investigation of the polynK^rphism of 
LSA-3 was carried out by sequencing the clone 679, 
^a^hich contains the bulk of the repeat sequences of the 
LSA-3 gene and a l-kilobase portion of the 3' non- 
repeat fraction, the sequence of this fragment being 
depicted in Figure 3 between nucleotides 207 and 1890 o 

Comparison of the sequences of the clone 679 
originating from Po falciparum clone T9/9S, and of the 
corresponding sequence of LSA-S originating from the 
isolate Kl, shows that the gene is well conserved, the 
most significant differences being observed in the 
repeat region where the blocks of 4 amino acids are 
well conserved but vary in their number and 
organi za t ion . 

In contrast^ the non-repeat 5' and 3' portions 
appear to be especially well conserved, showing up to 
100% homology in the 5" region where B and T epitopes 
have already been identified, 

DMA amplif icationSf in particular by PCR of 
different P. falciparum strains with 8 p^cimer pairs 
distributed over the whole of the IiSA-3 gene^ showed 
that^ except with the ones surrounding the repeat 
regions r the whole of the genome gives PCR products of 
similar size, suggesting that the IjSA-3 antigen is well 
conserved o 

Various LSA-3 probes^ chosen in the repeat and 
non-repeat regions ^ were hybridized at low stringency 
with the DNAs of different species of Plasmodium, and 
did not enable any gene homologous to to be 

identified except in the chimpanzee parasite 
P o reichenowi , confirming the close kinship of this 
species with P , f alciparum o 

Surprisingly, the antigen analogous to LSA-3 
found in P.yoelii ^ which gives clear immunological 
cross -react ions at the surface of the sporozoite with 
antibodies against the fragment 7298, does not appear 
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to be conserved at the level of the nucleotide 
sequence * Lastly, comparison of the LSA*3 sequences 
with the data bases did not reveal any homology with 
known molecules^ except for the repeat region, some of 
the. motifs of which display a strong analogy with the 
repeats of a Staphylococcus xylosis gene> but also with 
two P. falciparum antigens, RESA and Pf 11 .1, which are 
both expressed during the blood stage of the parasite. 
This homology is essentially due to the large amount of 
**Glu-Glu' seq[uences in these antigens and in the 
repeats of LSA-3. 

2} Cloning 

The insert DG729 and other regions of exon 2 of 
the strain Kl were cloned into a prokaryotic expression 
vector pGEX, a vector marketed by the company 
InVitrogen Corp (San Diego USA) * This vector produces a 
fusion protein with the Schistosoma mansoni glutathione 
S* transferase (GST) , and enables the recombinant 
proteins to be purified readily by affinity for 
glutathione -agarose beads. The expression peptides from 
these vectors are designated: 

- for the whole I»SA-3 protein: REC protein, 

- or for the fragment 729S:729PGEX. 

Attenqpts at cloning other fragments, in 
particular the fragment 1-5 3NSREP, 3NFREP, SNR and 
5SNR£P, caused difficulties related either to the 
cloning or to the production and purification of the 
proteins in sufficient amoxants for immxmization 
experiments . 

Only the fragments 729, NN and 3PC enabled 
corresponding recombinant polypeptides to be produced 
and purified in sufficient amounts for analysis of the 
antigenicity of the molecule. 



- 17 - 



Example 2 ; Comparison of the antibody regponaea of 
chimpanzee Nuria before and after immunization with 
different peptides 

Figure 5 depicts the amounts of immunoglobulins 
5 present in the serum of chimpanzee Nuria before and 
after immunization with the peptides 729NR1 and 729RE, 
and the lipopeptides 729NR2 and CTl. 

This experiment shows the superiority as 
regards B immunity of the R antigen, most particularly 
10 when it is conjugated to a lipid residue. 

Figure 6 shows that the level of specific anti- 
bodies measured by ELISA against the peptide 729NR2 in 
mice immunized with either the peptide 729NRII or the 
lipopeptide 729NRII is markedly higher when the lipo- 
15 peptide is used, irrespective of the species of mouse. 

Example 3 : Effects of the antibodies against 
the LSA-3 peptides on the inhibition of the entry of 
aporogoites in mice 

The techniques used to prepare the primary 

20 hepatocyte cultures, the sporozoites, the antibodies 
and the indirect fluorescence test are described in 
detail by S, Mellouk et al.. Bulletin of the World 
Health Organization, 68; 52-59, 1990* The table below 
compares the results obtained in immtinof luorescence, 

25 either with antibodies against the fragment 679 or with 
antibodies obtained against fragments originating from 
other peptides. The left-hand column shows the number 
of schizonts detected after 48 h of culture in hepato- 
cytes of Balb/c mice infected by P-yoelii and the 

3 0 right-hand column the same parameters after infection 
by P.berghei , 
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It is clearly apparent that the antibody 
against the peptide 679 has an almost complete 
5 inhibitory effect on the number of what they [sic] 
observed at 48 h in the liver cells. Likewise, Figure 7 
ahowe the inhibition of the sporozoite invasion of 
liver cells by hyper human [sic] sera obtained after 
immunization with different peptides and immunopurif ied 
10 against whole LSA-3. 

Example 4 ; Cytotoxicity teat against the 
peptide 729NRII in the chimpanzee Gerda 

The chimpanzee Gerda was immunized via the i.v. 
route with the lipopeptide 729NRII originating from the 

15 LSA-3 antigen. Blood is drawn 9 days after the 4th 
injection. The PBMCs were incubated in vitro with 
5 /ig/ml of the peptide 729NRII (addition of recombinant 
IL-2, 10 U/ml, on day 3) . On day 15, the cytotoxic 
activity was studied against autologous blasts 

20 generated with PHA at a concentration of 0.5 fig/ ml . The 
blasts were preincubated overnight with 5 ftg/ml of the 
peptide 729NRII, and with a control peptide, namely 
RESA, or without a peptide. The peptides are not added 



during the test (8 hours) - The number of targets per 
well is 5000. 

PBMCs from Gerda incubated for the same period 
with 5 Atg/nil of a control peptide or the peptide 729NRI 
(originating from the same antigen) do not bring about 
the lysis of autologous blasts preincubated or other- 
wise with the above peptides. 

Figure 8 shpws the results obtained for an B/T 
(effector to target) ratio vaxrying from 12 to 0.03. It 
is seen that the target cells presensitissed with the 
peptide 729NRII are ly&ed in the presence of effector 
cells, indicating a cytotoxic T type immune response 
specific to this antigen. 

The lipopeptide NRII injected via the i.v. 
route is capable, without adjuvant, of inducingf a 
specific cytotoxic response. 

Exanqple 5: Effect of the peptide NRI on 
interferon-Y production 

Interferons have been; shown to have an 
inhibitory activity in the development of P . falciparum 
in. human hepatocytes in culture (Sylvie Mellouk et al . , 
The. Journal of Immunology, vol. 13 9 No. 12; 41-92^ 41- 
95, 1987) . The results obtained with the peptides of 
the invention are as follows: 

The chinpanzee Gerda, immunized with the poly- 
peptide NR2 and boosted with the recombinant D6729r 
carries PBMCs capable of secreting high levels of IFN-y 
in the presence of the LSA-3 peptides, especially the 
peptide 729NRI, The result was confirmed in the 
chimpanzee Dirk, immunized with the same protein. The 
chimpanzee BRAM, an unimmunized control, does not show 
any interferon in the blood against the LSA-3 peptides. 
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The sequences ID n® l to 3 are depicted in figures 
1 to 3 . 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GTGATGAACT TTTTAATGAA TTATTAAA 28 

(2) INFORMATION FOR. SEQ ID NO; 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGTTGTTCTT GTTGAACACT TTTTACTAA 29 
(2) INFORMATION FOR SEQ ID NO: 6: 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
GGTATCGAAA CTGAGGAAAT AAAGG 25 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CATAGCAGGA ACATCAACAT CCAC 24 
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CLAIMS 

1, Polypeptide molecules containing at least 10 

consecutive amino acids of the amino acid sequence 
shown in Figure 2, the following polypeptides being 
excluded: 

- RDELFNELIjNSVDVNGEn^CENILEESQViroDlFNSLVKSVQQEC^H^ 

- VEESVEENDEESVBENVEENVENNDDGSVASSVEESIASSVDESIDSSI^ 
ENVAPTVEEIVAPT\^EIVAPSVVEKCAPSVEESVAPSVEESVAEMLKER 
(729S) 

- RDELFNEIiU^SVDVNGEVKENILEESQVNDDIFNSLVKSVQQEQQH^ 

- DELiFNELLNSVDVNGEVKENILEESQ, (NRI) 

- LEESQVNDDIFSNSLVKSVQQEQQHNV, (NRII) 

- VESVAPSVEESVAPSVEESVAENVESSV- (729RE) 

2. Molecules according to Claltn 1, characterized 
in that they contain at least 20 consecutive amino 
acids of the said sequence. 

3. Molecules according to Claim 2, characterized 
in that they contain at least 50 consecutive amino 
acids of the said sequence. 

4. Polypeptide molecule displaying at least 70% 
homology with one of the molecules of any one of Claims 
1 to 3. 

5. Polypeptide molecule, characterized in that it 
displays at least 70% homology with the following 
sequence : 

Leu Leu Ser Asn lie Glu Glu Pro Lys Glu Asn lie lie Asp 
Asn Leu Leu Asn Asn lie (CTl) , 

6. Polypeptide molecule according to one of 
Claims 1 to 4, characterized in that it displays at 
least 70% homology with the sequence depicted in Figtire 
3. 

7. Immunogenic composition^ characterized in that 
it contains at least one polypeptide molecule according 
to any one of Claims 1 to 6 and at least one 
phamaceutical vehicle. 
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8* Antimalarial vaccine con^osition containing^ 

among other immunogenic principles, a polypeptide 
molecule according to one of Claims 1 to 6 . 

9. Vaccine conqposition according to Claim 8, 
5 characterized in that it contains, in addition, a 

molecule containing at least one epitope and which 
originates from the group consisting of the LSA-l, 
SALSA or STARP molecules. 

10. Composition according to Claim 9, characterized 
10 in that it contains at least two immunogens, the first 

being chosen from the following polypeptides: 

- that of Figure 2, 

- NRI, 

- NRII, 

15 and the second being chosen from the group consisting 
of SALSA, SALSA I and SALSA II. 

11- Polyclonal or monoclonal antibodies which 

specifically recognize the polypeptide molecules 
according to any one of Claims 1 to 6 * 

20 12, Method of in vitro diagnosis of malaria in an 

Individual likely to be infected by P- falciparum , 
which comprises the bringing of a tissue or biological 
fluid taken from an individual into contact with a 
molecule according to one of Claims 1 to 8, under 

25 conditions permitting an immunological reaction, said 
polypeptide molecule and antibodies possibly present in 
the tissue or the biological fluid, and the in vitro 
detection of the antibody gene [sic] complexes possibly 
formed. 

30 13. Method according to Claim 12, characterized in 

that the tissue or biological fluid is brought into 
contact with a mixture of polypeptide molecules 
responding [sic] to one of Claims 1 to 6 and other 
molecules originating from antigens of the sporozoite 

35 stage, namely LSA-l, SALSA or STARP . 

14. Method of in yit_rp diagnosis of malaria in an 

individual likely to be infected by P, falciparum . 
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characterized in that it cotr^^rises the bringing of a 
tissue or biological fluid taken from an individual 
into contact with antibodies according to Claim 11, 
under conditions permitting an inimunological reaction 
5 in vitro between the said antibodies and the proteins 
specific to P . f al ciparum which are possibly present in 
the biological tissue, and the in vitro detection of 
the antigen/antibody complexes possibly formed. 

15. Kit for the in vitro diagnosis of malaria 
10 according to Claim 12 or 13, characterized in that it 

comprises at least one or several molecules according 
to one of Claims 1 to 6, 

the reagents for making up the appropriate medium for 
the reaction, 

15 the reagents enabling the antigen/antibody complexes 
produced by the immunological reaction to be detected, 
it also being possible for these reagents to carry a 
label or to be capable of being recognized in their 
turn by a labelled reagent, more especially in the case 

20 where the abovementioned polypeptide molecule is not 
labelled. 

16. Kit for the in vitro diagnosis of malaria, 
characrerized in that it comprises: 

- antibodies according to Claim 11, 

25 - the reagents for making up the appropriate medium for 
carrying out the immunological reaction, 

- the reagents enabling the ant igen/anti body complexes 
produced by the immunological reaction to be detectd, 
it also being possible for these reagents to carry a 

3 0 label or to be capable of being recognized in their 
turn by a labelled reagent, more especially in the case 
where the abovementioned antibodies are not labelled. 

17 . Use of a polypeptide molecule according to one 
of Claims 1 to 6 in the preparation of an antimalarial 

35 vaccine. 

18. Use of one or more polyclonal or monoclonal 
antibodies according to Claim 11 for the preparation of 
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a medicinal product intended for the treatment of 
malaria. 

19. Pharmaceutical compoaition containing as active 
siibetance one or more polyclonal or monoclonal 

5 antibodies according to Claim 11, in combination with 
an acceptable pharmaceutical vehicle. 

20. Nucleic acid seqiience, characterized by one of 
the following sequences: 

(a) the linked succession of nucleotides as depicted in 
10 SEQ ID No* 1 of Figure 1, or 

(b) the linked succession of nucleotides depicted in 
SEQ ID No. 2 of Figure 2, 

(c) a linked succession displaying at least 70% 
homology with that of Figure 1 or of Figure 2, or 

15 (d) a linked succession of nucleotides which are 
complementary to those presented in (a) , (b) or (c) . 

21. Nucleic acid according to Claim 20, containing 
a sequence coding for a polypeptide molecule according 
to one of Claims 1 to 6 * 

20 22 . Recombinant vector for the cloning of a 

nucleotide sequence according to Claim 20 or Claim 21 
and/or the e^^ression of a polypeptide encoded by the 
abovementioned sequence, containing the said sequence 
in one of the sites which is not essential for its 

25 replication, the said vector being, in particular, of 
the plasmid, cosmid or phage type. 

23. Vector according to Claim 22, characterized in 

that it is a plasmid deposited at the CNCM under the 
No. 1-1573 and referenced pKl,2- 

30 



- 25 - 



ABSTRACT 

Polypeptide molecules containing at least 10 
consecutive amino acids of the amino acid sequence 
5 shown in figure 2, representing the LSA3 antigen, the 
following peptides being excluded: 

- RDELFtreiiliNSVDWGEVKENILEESQVNDDIPNSLVKSVQQEC^ 

- VEES VEENDEE S VEENVEENVENNDDGSVAS SVEES IAS S VDES I DS S I E - 
ENVAPTVEEI VAPTVEE I VAPS WEKCAPSVEESVAPSVEESVAEMLKER 

10 (729S) 

- MJELFNELLNSVDVNGEVKENILEESQVNDDIFNSIiVKSVQQEQQHN 

- DELFNELLNSVDVNGEVKENILEESQ, (NRI) 

- liEESQVNDDIFSNSLVKSVQQEQQHNV, (NRI I) 

• VESVAPSVEESVAPSVEESVAENVESSV, (729RE) 
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(2) INFORMATION FOR SBQ ID NO; 1: 
(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61S2 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 1: 

ATTWTTTAT TtTTATTCTT TTATTTCTtT TTTTTCTTTA AATICTAIAT TTATAAATAT «0 

TRAAAAACT lACAAAAJCA CAAATAOXAA TTACAAATCA AATAATAAAA CATATAATGA . 120 

AAATAAXAAT C A AttAAAIAA CTA CXUT AT T XAAIACAACA AAXAX6aAS€ OGAZAAAAAA ISO 

AICTCAXATC ACAOAAAAAA tAAAtAACTA CTTTTRTtC ATCAAAAZIT. IGAiiSAXGCAC 2*0 

CATTRAATA TOCfCCICTAC AAXATGAtAA TAAOCXAAOA tAAAAAACXA AAXAAZAAAT 300 

aiAaataaaa AAAAAAAAAA AAAAAAAAAA AJOU^CTATA :T>gr^-yCTAT aatatatata 3sp 

TAZATAtATA tAXATATATA lATATAlAlA TAmATTTt lATnAnTA. tXA AAAJ^Hl 410 

rtrhTtATA TlAli^lllll ACttlCAlAT AAACAMBAGT tOSAAMAAA. AIACCTATGT 480 

ACAIAPMBAAA TtCAAXAAAC. XATTIAA^ MO 

AtXACCtACT CMAGAAeXAA AMMAAAAAT . TCTTGACTXA TTAfiAACAAC GAAAtACAXT 600 

AACICAAACT.CtAeAiCATA AtAAAAAXTT AflAAfeAACCC GAACAlAlAA A<gCAAAA'fttt MO 

CTtAnAACT AATATACAAC AACCAAAAOA .AAATATIAtT GACAAriTAT TAAAlAAlSAt 720 

• . . ' ' * • 

TOGiACAAAAT TCAGAAAAAC AAGAAAfitCT ATCAiSAAAAT CtACAACTCA CTCAtCAACT 780 

TXTXAATCAA TZAtTAAATA ClUlACATCt TAATCCAGAA CTAAAACAAA ATAlTTTOGA 840 

OCAAACTGAA CTtAAXGAOC ATATTTTTAA TACTTTACtA AAAAGTCTTC AACAACAACA 900 

ACAACAiCAAT CnrCAAfiAAA AACITCAACA . AACTCTACAA CAAAATCACC AACAAACTCT 960 

AGAAGAAAAT CTA C AACAAA ATCTACAACA AAATCACCAC CCAACTCTAG CCTCAACTCT 1020 

TGAACAAACT ATACCTTCAA OIC IT CA TCA AACTATACAT TCAACtATTC AACAAAATCT 1080 

ACCrCCAACT CTTGAACAAA TCCTACCtCC AACTCTTCTA CAAACTCTCC CTCCAACTCT 11*0 
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TGAACAAAGT GTAGAAfiAAA ATCItCAACA AACTCTAGCT CAAAATCTTC AACAAACTCT 1200 

AGCTGAAAAT CT7CAACAAA CTGTACCTCA AAATCTTGAA GAAACTCTAC CTCAAAATCT 1260 

TGAACAAATC CtACCTCCAA CTCTTCMCA AA.TCG1ACCT CCAACTCTrtC AACAAATTCT 1320 

AOCTCCAACT GTTOTAGAM CTCTGGCTOC AACTOTTGAA GAAACTCTAjB AACAAAATCT 13«0 

TCAACAAAGT etAOCTCAAA AtCTTISAAGA AAmtACCT GAAAATCTTC AAGAAAGTGT 1A40 

ACCTciiAAAl OTXGAAGAAA GTCTACCTGA AAATCTTQWl OMACTGTAiC CtttMAATCT 1500 

TCAACAAATC CTAOCT CCA A CICTTCAACSA AATOCTACCT CCAACICITG AACAAAlTCI 1560 

AGCTCCAAGT CTFRAGAAA 01CTGGCTCC AACKTTCAA GAAAGtCTAC AA6AAAATCT 1620 

ICAACAAACT ClACCtGAAA AICTTCAAOA AACXCtAGCT GAAAAtCTTG AACAAACTGT 1«80 

ACCTGAAAAT CTTGAACJUUk CIGTACCIGA AAA3CTTCAA GAAACTCTXAfi CtWiMMGt 17*0 

. TCAACAAACT CXAOCTQAAA AXCTTCMCA AACTCtMBCT GAAAAXCrtG AACMAXOCT l»00 

ACCTCCAACT CtrCMCAAA TOGlACCTCe AACttQTXCAA GAAATlCXAG CICCAACTCT 1860 

TOtACAAACT CTOOCTCCAA CTCTtCAACA AACTCXACAA OAAAATCTTC AACMACICT 1920 

AGCTGAAAAT CTTCMCAAA CTCTADCICA AAATOITeAA GAAAGT6IAC CtCAAAATCT 19S0 

tCAAOAAAGT 6TACCICAAA AXCtTCAACA AATOCTACCT CCiM^CTTBTrC AACAAAIOCT 2O40 

ACCtOCAACT cnCAAjBAAA TTCZACCTCC AACTCTtCTA CMAA|CT^TSC CTCCAACTCT 2100 

TGAAISAAACT CTAGMCAAA AXCTiCAACA AACTCTACCT GAAAATCTTC AACAAACTCr 2l«0 

AGCliipAAAAT CitCAACAAA CTCZACCTGA AAAliCTTCAA CMATCCTAC , CrcdwCXCT 2220 

TCAACAAATC CZACCTCCAA CTGTICAAGA AATTCTACCT CCAACtCTIC TACAAACTCT 22W 

CCCTOCAACt CTTGAAGAAA CTCtACAACA AAATCTTCAA GAAAGTCTAfil CtCAAAATCT 2340 

TCAACAAACT CTACCICaAA ATCTICAACA AACTCTAGCT GAAAATCTTC AAGAAACTCT 2*00 

ACCTGAAAAT CTIGAACAAA TCCTACCTCC AACTCTTCAA CAAATCCtAC ClCQAACTCT 2460 

tCAACAAATT CTACCTCCAA CTCiTCTACA AACTCTCCCT CCAACTCTTC AAGAAACTCT 2520 

AGAAGAAAAT CTTCAACAAA CTCTACCTCA AAATCTtC^ CAAACTCTAC CtOAAAATQT 2580 

TGAACAAAGT CTACCMiCAAA ATCTTCAACA AACTCTAGCT CCAACTCnC AACAAATTCT 26*0 

ACCTCCAACT CTTCAACAAA CTCTACCTCC AACTCTTCAA CAAACTCTTC CTOAAAACCT 2700 

TCCAACAAAT TTATCAGACA ATCTTTTAAC TAAtTTATTA CCTCGTATCC AAACTCAGGA 2760 

FZOURB X/B 
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*AT««6«C *CTAt*IT*» A««Ut*e» ««CT«A» «««WT*« IC««»C««T 
ACTAOAAAAC CIACAACAAA CntCACCTOA AACIBTAAeT ACITIT.OTA ACATAWABA „io 
eC»6A«CAA CAAAAtACtt ttACTAAIBA TAC«I»a« «»AAA™c AACAACt^ i„0 

«*«**Tc.* TTAACiceoc cimoAAAA «eec^ 

AAT«««IA *t«»A«««C t»AA««*6A COIOPCIAOC AClmAIAB AAACICTCe* 
««»C«*A CAAAAOAeOB CAAAtACAAt TA«^ 

"C»«A«A ACIAAICAAA *»««AC« ^^^^ 
TAAT.C«« ™eA,««, ^^^^^ ^ 

•-AWAAA™ «A«AA.CAT TTmACWA AAtAmCAT AATCttAAAC CAATACAACA 

*«^»*ca^ 
•*«*n«r .mAAWAAA AKteemc tt.^ 

«*«Cm» IWAA^ 1««AA«A, T««eiACT CAAGCICTIC AACAAACKT 3AM 
«eWMC»T CnCAACAAA AtOtAT.** eBATettWT e»CCn»IA nAAACAICA ,«o 
««™<»A *^»AA» A«C«CAOC WtCMAOiu A»»m,<A AtTtCSAAGA ««, 
»«»«TI»AA ACTBAAACTC OetMTl^e TCTAfJAAttA ATOAfittTC AACeOClicA 
^***C»COTA «AAA»C«A Cm»C»T «AtT«A«A .««.i»AA A»Wct«« 
I«Att»«» C«A«««A»C «»«Tm*e «CA«Aaic AIAdAlCCAC lisAAeAMB 3m 

«»XM*A«» «TK«»^ en^^ 

TOTO-SA cmttBMs AAeneAAei caar«iatg wnAAAGre ite«AA«a 3k» 

THAOAA™ M*A«TA«» AA««ACtT A^ CiWTCAAA T*»*W»<»t 

TACtA«.AA CIT«TO«5 ««,re»A<ai CTTAAA^^ »^ 
ASATATOSAA A*AT,AAAAC .A*««««AA ACCAIIaW 

CATCAIACAT TACMAAACT TATTPAApi* CAACAWA^. T«A<W«i«: 4,A, 

On«»yTS»A CUCTAttAt TAAAAOATOT aAA««ac AAOAICCAAA AACTATCTCA »», 

TTTAAAACAt eiTB»A«*e ATATATTAAA AOAAWaAA* CAAATCAAAC AACTOAAAC AIM 

«**AITrt« CAACAITAI* AA«A7T»AA AACTATOAA AC«ATA*TT TAOAACACAA «3«, 

«*«=MATA CAAAAACATC ATTTTSAAAA ATTCCMOAA «AA«Cl«AC AAATAAAACA *„0 

FXOUfUS 1/C . 
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TCnCAACCA GAtATATTAA AAOAACtATC TTCATTACAA CTTCAACAAC AAAAAAAATT 

ACAAfiAACTA CACGAATtAA AAGAAGACCT AGAACATATA ATAACtCClt; ATCCCCATAT 65O0 

AAAACGnrO CAAGAACATG ATTTAGAACA ACTACATGAT TtAAAACGAA «TaTATTACA *560 

CATGtTAAAC OCAGATATpC AATEAODCGA TATOGATAAC CAAACtTXAC AAGATCIAAC 4620 

AAGAAAACrr^CAGAAACAC TTGAAtCCtT AAAAGAlVtT TTATCtAC^ qAXZACpCAT «680 

GGATttAA^ CAAATGAAAA- CAAfiAAAAAA AISCTCAAACA ^CCtAACTIOG AAtiMCtAlT 4740 

ATTAAAACAA. GACCTTAAAG A A CAACCAAA tiAAAAAAATA ACAAAAAAOA AAlHiAACCIT 4800 

TCAtATTAAC CAl^AOGAAC CAAAAGATCA AATACTAGAA CTTGAAATOA AA6AXGAAGA 4860 

TATACAACAA CATGTAGAA6 AAQATAtACA ACAAGATA1A GAACAACAXA AAfitlQAAGA 4.920 

XATAGATeAA GAXAIACATC AACATATAQC TCAAO^^ 4960 

ACtCCAAAAA CACAAACCCA tTCAAAACCT T A AAftTB^aii AAft^^^^^^T T ft ffft ftA ftAflA 5040 

MfrtiBMKk GQtCX^^ GKTUAMA ACACCIACAC CAACTAATGA AA3AIGITCA 5100 

AAAAATtCAT AAAGAACttiB ATAAJWCAACT AlCZAAACqt ITlACAAiCAA AAMXOASVT 5160 

TACtAATCri TTAAAACAAA AtCAAGATtT rtTTiiCMh <fnAkMCr ICCIAAAAAA 5220 . 

ATAXAAACtA 1TTGCTCCAC CAnCAtATC lUAA-l lltU . OCATncCAT CAlAaCTAiBT 5280 

TCCCilC'm ACATttTCtT. YATmCATC ATCTGIAACA ATAeCTTCTt CAACTlACXT 5540 

ATtATCAAAA vCtlGACAA^ G^ATAM'i^ AAAXAACGAiI^ AGACOCTTXT ATT^TTtCT 5400 

ATITCAIATC TTIAACAATT TAAAACAn^ Sk£0. 

AGAAAAAAAT AA3^tGtAA T^^^ 5520 

OCTAACAAAT . AAAACCCAAA AAACAACTAA ACTTQAtAAA AATAATAAAC TACOGAAJIAA 5580 

AAGAACAAW CAAAAATCAA AAIAAAAAAT iX»^^ TCAAAICATT OGACOauiCA 5^0 

ATAAAATTAA TCQATAAAAA ATATAAAAAJ CTATATAttA tCTAAATATA TATAAATAAA 5700 

''AAATAAATA CATaCATATA TATATATATA TATATCTATC TTTTtACAAA Al^ 5740 

. TITAAAATTT ATATATATtA ATATTTATAT TTTTCCATAT ATAATrTTAT TTTCAATATT 5820 

TTATTITTAA TTATAAATCT TrrtTACACA CTTTATCtTT TITAATTAAT ATATACATTT S8S0 

CtCTAAGAAA CTCTATATTA rrC^TACGAT ATATCTiUlTA TrAAtTATTT Gli.iiHA TT 5940 

AAAAITTATA TTATATAATA TATATATATA TATATaTCTA TATATATTAC AAGATAAAAA 6000 

FIGOIIB 1A> 
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TrXACCTTAT TTTCCTTCIT ATCCAAATAA CCTTTTTTTT UiiliHH iiiHAAiiC 60W 

ATAtAAACGA TCTTTAATTT TXAAlllllA ATATTTTATA TAAAATATTT TTCCTAAAAA 6120 

AAAAAAAAAT tAAAAAAAAC TlAtAXTTCG AA 6152 



FZOOBS 1/B 
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(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5361 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNBSS : doxible 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..5361 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



ATG ACA. AAT ACT AAT TAC AAA TGA AAX AAT AAA A£A lAT AAT OAA AAT ' AS 
ifBC Thr ABn S«r Am Tjrr igrs 8«r Avh A«n Igr* Thr IVr A«n Clu Assi 

• 1 . • . . . • 5. • • • ■ ■ • 10 . ■ - • • • IS-- 

AAT AAt OAA CAA AEA ACT AOC ATA TTT AAT ACA ACA AAS: AXG AAT COC 96 
Asn Asn Civ Cln. Xli^ .Ttar Ihr 11* Vtwi A«ii Arg Thr Asfk ltee. Asn Pvo 

• • -26 •. . -..25 3p-'. 

AIA AAA AAA TOT CAT Aie ACA CAA AAA A1A AAT AAC TAG TTT TTT TTC M 
IK. LyA Ly« Cya Hi* itoc Arg Clii tys IK JUpn lys T>v Am Pti* Lmi 

.35 •. AO .. • AS ■ • ■ 

ATC AAA AIT TTC ACA TOC ACC Aft TTA ATA TGC GCT.CTtA CAA TAT GAT 192 
II* Lys tl» l«u thr Cys. Tbr XI* l»u il« Tcp Ala Vrnl cin TVr Am 

■. ■ 50 55. ,.60. 

AAT AAC TCT 43AT ATA AAC AAJB ACT TCC AAA AAA AAT ACC TAT CTA GAT 240 
Abd Am Smr Asp Ila Asn Lys S«r .Trp Lya Lys a«ii Thr TVr V«l A»p 

«ai 70 ■ .. . • • . to 

AAC AAA TIC AAT AAA CTA TTT AAC AfiA ACT TXA CCA OAA TCT CAA CTA 288 
Lys Lya Law Km Lys Lau Fh» Asn Arc Smv Uu Cly Clu Ser Clh V*l 

' ' . ■ W> . M. •■ ■ 

AAT OCT CAA TTA CCT ACt CAA CAA CTA AAC CAA AAA ATT CTT GAC TTA 336 
Asa Cly Cltt Lni Ala Scr Olu Clu Val Lys Clu Lys .lie Ucu A«p Lau 
.100 105 110 

TTA CAA CAA . OCA. AAT ACA TTA ACT CAA ACT CTA CAT CAT AAT AAA AAT - 384 
Lau Clu Clu Cly Aan Thr Lau Thr Clu Sar V«l Asp Asp Asn Lys Asn 
lii .120 125 
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TIA CAA CAA CCC CAA GAT ATA AAC CAA AAT ATC TO TO ACT AAT ATA 
Su Cl« Clu Al. Clu A.P 11. Ly. Clu A,n llm t«u S« A-« 11» 

130 . 

OAA lUA CCA AAA CAA AAT ATI ATT CAC AAT ™ i S 
Si Slu Pro ly. Cl« A«« n* !!• A.p Aw» l*u L.« A«i M l\* 

IAS . .150 

CyUi AAT TCA GAA AAA CAA GAA ACT CTA TCA ttJA AAT 2^2}^ ^ ^ 
Aift S«r ciw Ly. Cln CU S*r v*l J« «1« ^ 61» JjJ 



CAT CAA CXT ITT AA* CAA TTA TXft AAT ACT CIA CAT W gA CA^ 

A« Clu 1«» >h« A«i Clu l*u L.U A«» S«r V*l A.p ^^l M Cly Clu 

. • . ISO'. • . 7^ • 

CTA AAA 4:aA iAT ATT TTC CAB CAA ACT CAA CTT AAT CAC CMT ATT TTt 
Si S« Clu A« II. 1-u OXU Clu 8« Cl« V*l Am A-p 11. fH- 

•195- • ■ •200:'.. . .■. 

AAT ACT TtA CTA AAA ACT OTT CAA CAA CAA CAA CAA CAC *** Jf^ 
iSi ™ Ly. »*r V.1 Cln Cln Clu Cln Clli HI. A«l V»l Cl» 

CAA AAA CTT CAA CAA ACT CTA CAA CAA AAT CAC » «AA A^T .OTA «AA 
Su tj. V«l Clu Clu S.T V.l Clu Clu AS. A^ Clu Clu £«c Vpl Clu 

CAA AAT CTA CAA CAA AAT CtA CAA CAA AAT CAC CAC OCA ^ 
S« aS ^ ClS Oiu Am V.1 Clu Clu An A«F MP C\y imT JjJ Al. 

TCA ACT GTI CAA ClM^ ACT ATA CCT ^ ^ 

S« $*r Vrt Cl« «lu II* AU S.r S.r V.1 A.* Clu II. A^ 

'.. 260'. •.■2W- 

TCA ACT ATt CAA CAA AAT CTA CCI CCA ACT CTT cijA CAA CTA WT 
Smv S.r 11^ Clu Clu A«n V.l Al. Fro Thr V.l Clt» Clw W v.i a*. 

.•■ 275 ■ • • 2BO • • *•* 

eCA ACT CTT CTA CAA ACT CiC CCA JCT PlT CAA CAA ^ SI 
Pro S«r Vol Vol Clu S*r V.l Al. fro S.r V.l Clu Clu S.r Vol OXU 

. .. 290 •• ■ • .' •• ^ : • : 

CAA AAT OTT CAA CAA ACT OTA CCT CAA AAT CtT CA^ CAA ^ «^ 
S« vS Clu Clu S-r Vol Al. Clu A.« V.l Clu Clu S.r Vol Al. 
305 310. 31* . 

CAA AAT CTT CAA CAA ACT CTA CCT CAA AAT CTT CAA GJA APT CTA CJT 
Clu A«h V.l Clu Clii S.r V.l Al. Clu Asn V«l Clu Clu Sac V.l Al. 
325 330 
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CAA AAT Cn GAA GAA ATC CTA CCT CCA ACT CTT CAA GAA ATC CTA CCT 10S« 
Clu ASA V«l Clii Ciu 11« Val Ala Pro thr Vil Clu Clu XI* V«X aIa 
3A0 3*5 350 

CCA ACT CTT GAA GAA ATT CTA CCT CCA AlBT CTT CTA CAA ACT CTC CCT X104 
Pro Thr. VAl Clu cXu Ilo y»l aX« Pro S«r V«l V»X CXu Sor V«X AXa 
3S* 3«0 365 

CCA ACT CTT QAA GAA ACT CIA GAA CAA AAT CTT GAA GAA ACT CTA CCT 1152 
Pro Sor V«X Clu Clu 5«r VaX Clu Clu Am V«l Clu Clu Sor Val Ala 

.370 375 

^ i^'' 2^ ^ ACT CIA CCT CAA AAT CTT GAA GAA ACT CTA CCT 1200 

Clu Amn Val Clu Clu Sor V«l Ala Clu Aan V«l du Clu Sor Val Ala 
385 390 395 400 

GAA AAT m GAA GAA ACT CIA CCT GAA AAT CTT GAA GAA ACT CiA CCT 1248 
Clu Amu Val Glu t Gilt Sor Val Ala Clu A«n Val Clu Clu Sor Val Ala 

: . 410 415 

GAA AAT GIT GAA GAA ATC CTA GCt CCA ACT CTT GAA GAA ATC CTA OCT 
Clu Asn Val Clu Clu val Ala Pr^ Thr Val Cli» Clu Xlm Val Ala 

- ' ' ■ «'30. 

?^ 21? ?f S?* GAA ATT CXA CCT CX:a ACT CXT GIA GAA ACT .GIG CCT 
Pro nkr, V«l Glu Clu Ilo Val Ala Pro Sor Val Val Gin $or Val Ala 

■ . ■■**C ■. " .• ■■ 445 

ACT CTT GAA. GAA ACT GIA GAA GAA AAT CTT GAA CAA ACT CTA GCT 
Pro So* Val Glu Glu Sor Val du clu am Val Clu Clu Sor Val Ala 

•.*3C .■ .• ■ • 455' . 



1296 



1344 



1392 



^ ^ ^ ^ 9^ !BCT MX CTT CAA CAA ACT CTA CCT 1440 
Clu Am Val Clu Glu Sor Val Ala Clu Aan Val Clu. clu Sor Val Ala 
••3 . .470 475 4S0 

CAA AAT CTT GAA GAA ACT CTA CCT CAA AAT CTT GAA CAA ACT CIA CCT 14BS 
Clu Aan Val Clu Clu Sor Val Ala Clu Ami Val Clu Clu Sor Val Ala 
. • ,490- :495 

GAA AAT CTT GAA GAA ACT CTA CCT GAA MT CTT G^ 1536 
Clu Afu Val Clu Clu Sor Vol Ala Clu Aan Val Clu Glu Sor Val Ala 

■ 500 505 . 5X0 ■ ■ ■ 

^ CTT G44 GAA ATC CTA CCT CCA ACT CTT GAA GAA ATC CTA CCT 1584 

Clu Aan Val Clu Clu Ilo Val Ala Pre Ihr Vol Clu Clu Ilo Vol Ala 

• 515 • • • 520' • ■ 525 

CCA ACT CTT GAA GAA ATT CTA CCT CCA ACT CTT CTA GAA ACT GIG CCT 1632 
Pro Thr Val Clu Clu Ilo val Ala Pro Ser Val Val Clu Sor Val Ala 
530 535 540 
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COk. ACT CTT GAA GAA ACT CtA .OAA GAA AAT CTT GAA GAA ACT CTA CCT 1680 
Fre'S*r Vml Clu Glu S«r VaI Clu Clu Asn Val Clu Glu S«r AIa 

545 550 . 560 

CAA MT CTT GAA GAA ACT CIA CCT CAA AAT CTT GAA GAA ACT CTA CCT 1728 
Clu Asa Val Clu Cltt 5»r V«l1 Al* Clu Asn Val Clu Clu S%t Vml Aim 

565 . 570 . 575 

GAA AAT Ctt iGAA GAA ACT CTA CCT GAA AAT CTT GAA GAA USSC GXA GCT ' 1776 
Glu Asn Val Clu Clu Smx Val Ala Clu Ash Val Clu Glu Ila Val Ala . 

• 5B0 -. • ■■ ■ •. 585 • ■ ^90 

CCA ACT OTT CAA CM. ATC CTA CCT CCA ACT CTT GAA GAA ATT CXA CCT 1824 
Px« Tta VAl Clu Glu lia Val Ala Thr Val Clu Clu 11* Val Ala 
5?5 600 605 

OCA ACT CTT dCA GAA ACT GIG GCT CCA ACT CTT CUA GAA ACT CTA CAA 1872 
Pr« -Sar iVal.Val Clu Sar ¥al Ala Pro Sar Val Clu Clu Sar Val Clu 

610 ^ 615- 620 - ■ .■ • 

GAA AAT CTT GAA GAA ACT Gxi CCT CAA AAT CTT GAA GAA ACT OTA CCT 1920 
Glu A«u Val Clu Cltt $ar Val tCU Clu Asn Val Cltt Glu Sar Val ^n 

6.J5- ;•■ ■ 6'50-. ••635 --. y.- ••-640." 

GAA AAT CTT GAA GAA ACT CTA eCT GAA AAT err GA^ 1968 
Glu Asn VAl Glii Glii 5ar Val Ala Clu Asn Val Clu ClU Xla Val Ala . 

.■ ■ 645 . ■ .650 . • -.655 , • 

CCA ACT CTT GAA CSAA ATC CTA CCT CCA ACT CTT GAA GAA ATT CiZA CCT 2016 
Pro Thr Val Cltt Glu Xla Val Ala. Ps* Thr Val Glu Clti ila Val Ala 

■'■ • ■ ■• .'^^o • 

CCA ACT CfT CTA GAA ACT CTC CCT CrCA ACT CTT GAA GAA ACT OTA GAA 2Q64 
Pro Sar Val Val Glu Sar Val Ala Pro Sar Val Clu Clu S*r Val Glu 

. ...675- •.• 680 : . . 

GAA AAT CtT GAA CAA ACT CtA GCT GAA AAT GTt GAA CAA AGT CTA COT 2112 
Glu Astt Val Clii Clu Sar Val Ala Clu Aim Val Clu Clu Sar Val Ala 

690.-., *'5 ■. . • . 700.' ■■ 

GAA AAT CTt CAA GAA AGT CTA OCT GAA AAT OTT GAA - GAA ACT CTA CCT 2160 
dv Asn Val Clu Glu Sar Vsl Ala Clu Asn VaI Clu Clu Sar Val Ala 
70S 710 715 720 

GAA AAT CTT GAA GAA ATC CTA CCT COi ACT iSTT CAA CAA ATC GTA GOT 2208 
Clu Asa. Val ciu Clu Ila Val Ala Pro Thr Val Clu Clu Ila Val Ala 

• ■ . ■ 724 ;.- . 7.30 ... 

CCA ACT CTT CAA GAA ATT GTA CCT CCA ACT CTT GTA CAA ACT tnC CCT 2256 
Pro Thr Val Clu Clu 11a Val Ala Pro Sar Val Val Clu Sar Val Ala 

740 . 745 750 
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CCA ACT CTT CAA CAA ACT CTA CAA CAA AAT GTT CAA CAA ACT GTA CCT 2304 
Pre Ser V*l Clu Clu S«r V«l. Clu Clu Asn Val Glu CLu 8«r V«l Ala 

75$ . 7iO 765 

CAA AAT err GAA CAA ACT GTA CCT CAA AAT CTT CAA CAA ACT CTA CCT 2352 
Clu A*n V«l Clu Olu S«r Val Ala Clu Asn Val Clu Clu S*r V«l Ala 
770 775 780 

CAA AAT err GAA GAA AST CTA CCT CCA ACT CTT GAA CAA ATT CIA CCT 2400 
Clu 4ni Val Clu Clu 8ar Val Al* Fro Thr Val Clu Clu Il# Val Ala 

785 . . 790 7« 800 

CCA ACT CTT GAA GAA ACT GTA CCT CCA ACT GTT GAA GAA ACT CTT GCT 2448 
r»e Sar Val Clu Glu Sar Val Ala Pre Sar Val Clu Glu Smt Val Ala 
«05 810 815 

GAA AAC CTT CCA ACA AAT. TTA tCA CAC AAT CTT TJA ACT AAT T1A TZA' 2498 
Glu Awn Val Ala Tbr Asn Lau Sair Asp Aan Lau l«u Sar Aaa Lau Imu ■ 
820- 825' 850. 

CCT CCT ATC CAA ACT CAC GAA. ATA AAP GAC ACT AXA TTA AAT CAC ATA 2544 
Cly Cly. 11a Glu Tht Clu Clu lie lors Aap Sar Ila Xjmi Asn Clu Xla 

■'835 840' .• - ■• . • -845 

GAA CAA. GTA AAA GAA AAT CZA CIC AOC ACA ATA CIA GAA AAC^CTA CAA 2592 
Glu Clu Val Lya Clu Asn Val Val Tbr Tbr 11a hmt Clu asp Val Clu 

■850 ■■ ■ 855 •: •. • . 860 ' 

GAA ACT ACA OCT . GAA ACT CTA ACT AC? TIT ACT AAC ATA TTA CAC CAC 2640 
Glu Thr Tfar ALm Clu Sar V«l Thr Thr Pha Sar Asa Xl» tau Clu Clu 

•«45 •■• •••70.- .'■875.. 880. ' ' 

AtA CAA CAA AAT ACT ATT ACT AAT GAT ACT AlA GAib GAA AAA TTA GAA 2688 
lie Cln Clu Asn Thr lie Thr Asn Asp tbr lie Glu Clu Lys Laii Clu ' . 

- 885 890- • 895 ... ■ 

CAA CTC CAC GAA .AAT GTA TTA ACT CCC GCT TTA GAA AAT ACC CAA ACT 2736 
Glu Laii His Clu Asn Val Lau. Sar- Ala Ala 1m Olu Asn Thr Cln' Ser 

9O0 ■•.905 . 910' • •■• 

GAA GAG CAA AAC AAA GAA CTA ATA CAT CTA ATT CAA CAA CTA AAA CAA 2784 
Clu Clu .Clu Lys Ly* Clu Val lie Asp Val He Cltii Clu Val l:ys Clu 

.' ■•^^ • • ■ W ■ 

GAC CTC CCT ACC ACT TTA ATA CAA ACT CTC GAA CAC CCA CAA CAA AAC 2832 
Clu val Ala Tlir Thr Lau lie Clu thr Val Clu Gin Ala Clu Clu Lys 
930 935 940 

ACC OCA AAT ACA AtT ACC CAA ATA TTT CAA AAT TTA CAA GAA AAT CCA 2880 
Ser Ala Asn Thr He Thr Glu lie Ph« Clu Asn Leu Clu Clu Aan Ala 
^45 950 955 960 
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CTA GAA ACT AAT GAA AAT CTT CCA CAC AAT TTA CAC AAA TTA AAC GAA 
Val CIu Ser Aan Clu Aati Val Ala Clu Asn Leu Clu Lys L»u Asn Glu 
W 970 975 

ACT CXA TTT AAT ACT CTA TTA OAT AAA CTA CAC GAA ACA CTA GAA ATT 
Hur VmI Ph« Asn Ihr Val Lau Asp hym Val Clu Glu Tkix Val Clu Ila 
. 9B0 985 990 

ACC OCA GAA ACT TIA CAA AAC AAT GAA ATC CAT AAA CCA TIT TTT ACT 
Sar Cly Clu Sar Uu Clu Asn Aw Clu Mae Asp Lys Ala Pha Pha 8«r 
?W ipdO 1005 

S^* CTA AAA COA ATA CAA GAA AAT TZA TZA ACA OCT 

Ji?*"** '^'^ As« 1*" tau Thr Cly 

1010 . . 1015 1020 

ATC TTT CCA ACT ATA GAA ACC ACT ATA CTA ATC CAA TCA CAA CAA AAC 
Mat Iba Ars Ssr-.Ila Clu Ihr Sar Ila Val Ila Cln Sar Clu Clu Lvs 
X02S 1030 1035 1040 

C1T:6AT TTC AAt CAA AAT CtO CTT ACT TOC AR TTA iSAT AAT ATA GAA 
Val Aap Lau Asn Clu Aan Val Val Sar Sar Ua Uu Aap Asa ll* Clu 
1045 1050 1055 

AAT ATC AAA GAA OCT TTA TTA AAT AAA TTA GAA AAT AJTT TCA.ACT ACT 
Asn Mac Lys Glu Cly Lau Lau Aan lys Lau Clu Aan Ila Sar Sar Thr 

IWO 1065 1070. 

5J* 2* ^ ^ CAA ACT OtA ACT CftA CAT CtA AAA CAA AAT CTA TAT 
Clu Cly Val Gin clu Thr V^ Thr du Hia Val Clu Cla Asn Val Tyr 

. ioao ipas 

f*^ 2IT OCT ATC AAA GAT CAA TtT TJUi OCA AM TTA 

*^ tJILT'^ ^ Vy» A»l» Cin Pha Lau Cly Ila Uu 

1095 . : llOO 

A^ CAC CCA CCA OGC Tie AAA CAA ATC TTT TTT AAT TTC CAA CAT CTA 
M Clu Ala Cly Cly Lau Lys Clu Mat Pha phs Aan Uu Clu Aap Val 

1105 luo 1115' 1120 

TTT AAA ACT GAA ACT CAT CTA ATT ACT CTA CAA GAA ATT AAC CAT GAA 
Pha Lys Sar Clu Sar Asp Val Ila Thr Val Clu Clu Zl« Lya Asp Clu 

•1125 • •.. 1130 ' ■ . .• U35. 

pec CTT CAA AAA GAC CTA CAA AAA CAA ACT CTT ACt AtT ATT CAA CAA 
Pro Val Cln .Lys Clu Val Clu Lys Clu Thr Val Sar Il« il» Clu Clu 
11*0 IMS 1150 

ATC CAA CAA AAT ATT CTA CAT CTA TTA CAC CAA CAA AAA CAA CAT TTA 
Mae Clu Clu Asn Ila Val Asp Val Lsu Clu Clu Clu Lys Clu Asp Lau 
1155 1160 1165 
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ACA OAC AAC ATC AXA GAT CCA GTA CAA GAA TCC ATA GAA ATA TCT TCA 35S2 
Thr Asp Lya Mac IXm Asp Ala VaI Clu Clu S«r II* Clu Zl« Sct Ssr 
1170 11 75 1180 

GAT TCT AAA GAA OAA ACT GAA TCT ATT AAA CAT AAA GAA AAA GAT CTT 3600 
Asp Ssr Vfm Olu Clu Thr Olu S«r lis Lys Asp Lys Clu Lys Asp Val 
1185 1190 1195 1200 

TCA CTA CTT OTT GAA OAA CTT CAA GAC AAT OkT ATC CAT CAA ACT CIT SMS 
S«r Isu Val Val Olu Clu Val Gin Asp Asa Asp Hsc Asp Clu Sar Val 

1205 . 1210.. . 1215 

GAG AAA CTT TXA CAA TTC AAA AAT ATG GAA GAfi GAG TIA ATC AAS CAT 5fi96 
Clu Lys Val lau Olu Lau Lys Asn Mst Glu Clu Clu Lau Mat lys Asp 
1220 1225 1230 

OCT . CTT GAA ATA AAT GAC ATT ACT AGO iUA CTT ATT GAA CM . 3744 

Ala Val Clu Zla Aim Asp Ila Thr Sax Lys Lau Ila Clu Clu Tbr Cln 
1235 1240 1245 

cm: TZA AAT C*A CXA CAA OCA GAT TIA ATA AAA CAT AJtC GAA AAA TTA - 3792 
Clu Lau .Asn Olu Val . Clu Ala Asp Lau Ila Lys Aap HaC Clu Lys Lau 

•.1250 ■ 1255- ■ :i2«ji.- 

AAA GAA TXA 6AA AAA CCA TTA TCA GAA CAT TCT AAA GAA AlA.ATA GAT 3t40 
Lys Glu tsu Clu Xys Ala Lau Smw Clu Aap Sar lOTs Olu XI* Ila Asp 
12« 1270 .1275 1280 

OCA AAA GAT GAT ACA TTA CAA AAA CTT ATT CAA CAC (BlU OiT GAT ATA ■ 3888 
Ala Lya Asp Aap Tlir Lau Clu I^rs Vai Ila Olu Clu Clu His Aap Ila 

.;. uto . 1290 ■•. . "■"•s 

AOG AGO ACQ TTC «SAT CAA CTT OtA CAA. TTA AAA GAT GTC GAA GAA OAC 3936 
"TWr Thr Thr Lau Aap Clu Val Val Glu Lsu Lys Aap Val Olu Glu Aap 
•• -ISOO- .• 1305 IZIO 

AAC ATC GAA AAA CXA TCT GAT TtA AAA CAT CTT GAA GAA GAT ATA TTA 3984 
Lys Ila Olu Lya Vkl Sar Asp Ijbu Lys Asp Lsu Clu Clu Asp Zla Lau 
1315 1320 • 1325 . 

AAA GAA CTA AAA GAA ATC AAA GAA CTT CAA ACT CAA ATT TTA CAA GAT 4032 
Lys Clu Val Lys Clu Ila Lys Clu Lsu Clu Sar Olu He Lsu Clu Asp . 

■ ■ 1330. . 1335. • 1340 

TAT AAA. M TTA AAA ACT ATT CAA ACA CAT ATT TTA GAA GAG AAA AAA 4080 
Tyr Lys Clu Lau Lyc Thr Ila Clu Thtr Asp 11c lau Glu Clu Lys Lys 
X3*S 1350 1355 1360 

CAA ATA CAA AAA GAT CAT TTT GAA AAA TTC CAA CAA CAA OCT CAA GAA 4128 . 

Clu He Clu Lys Asp His Pha Glu Lys Phs Clu Clu Clu Ala Clu Clu ■ 
1365 1370 1375- 
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ATA AAA CAT CTT CAA CCA CAT ATA TIA AAA CAA CTA TCT TCA TTA CAA 
lU lyl A.P U»u Clti Ala AFP !!• Uu Ly« Clu V.l Ser Ser L.u Clu 
1380 

CrrCAAGAACAAAAAAAATTACAAlSAACTACACCAA«AAAAGAACA6 
vS S« Clu Cltt Ly« Ly. Uu Clu Clu Val His Clu IMU Ly* Clu Clu 
1395 1*W> ***** 

CTA CAA CAT ATA AXA ACT CCT OAT CCC CAT AXA AAA CCI TTC JAA CJA 
vS Clu HI. Il« II. ST Gly A.P Al« Hi. U« Cly Clu Clu 
1410. U15 1420 

CAT CAT TTA CAA CAA CTA CAT CAT tlA AAA CCA ACT ATA TTA CAC AlC 
ASP ABP X*u Clu Clu V«l Asp A«p IM* Lys Cly Smx Xl« Uiu A«p Itat 
IA25 1430 >*35 1440 

TTA AAC OCA GAT ATC CAA TZA GGC CAT ATC CAT AAC CAA ACT TTA CAA 
Uni l.y« Cly Asp lUc Clu 1«> Cly Asp lUc Asp t^rs Cl» l-«^C**? 

1445 : 1450, *•» 

CAT CTA ACA ACA AAA CTT OOA CAA ACA CTT OAA TOC TO AAA CAT 
Asp Vsl Thr IHr l.y* U« ciy Clu ats V.l Clu «9T 1m lys A»» V«l 
•- ' 1460 • -lAfiS- . ■ .• 1470 .-. 

TTA TCr ACT CCA TEA COC iiTC CftI CAA CAA CAA AlC AAA ACA.ACA AAA 
L«u Sat Bmr Al« l^u Cly Asp Olfa Clu Gin »•« lys.Thr Ar» Lys 
1475 1400 14>5 

AAA CCT CAA ACA OCT AAC TIC C4A CaA CtA TTA TX4 AAA OAA ^ CIT 
Lys AU <Slt» Ari Pro tys X-u «1« Cl« Clu Clu M 

1490 ••• ■ ■ •■149> . ■" ■ 1509 

AAA CAA CAA CCA AAC AAA AAA ATA ACA AAA AAG AAA CTA AOO TTT CAT 
Lts Clu Clu Pt» 1.ys: iys uys II* thr Lys Lys 1y« Vsl Ar» VUm A«p_ 
1505 . 1510 .. ^'f" 

ATT AAC CAT AAC CAA CCA AAA CAT CAA ATA <^A OAA CIt JJA ATC AAA 
U. Lys Asp Lys Clu Pro Lyi Asp Clu lis V«l Clu V.l Clu H-c Xy« 

1525 ■ 1530 ■ 1.9*» ■ 

GAT CAA OAT ATJi CAA CAA GAT CTA CAA CAA CAT ATA CAA CAA GAT ATA 
Asp Clu Asp il« Clu Clu Asp V«l Clu Clu Asp lis Clu Olu Asp J.U 
1540 1545 1550 

CAA CAA CAT AAA CTt CAA GAT ATA OAT CAA CAT ATA CAT CAA CAT ATA 
Clu Clu Asp Lya Vsl Clu Asp XU A»P Clu Asp lis Asp Clu Asp Il« 
1555 ISW 15«5 

OCT CAA CAC AAA CAT CAA CTT ATA CAT TTA ATA CTC CAA AAA CAC AAA 
Cly Clu Asp Lys Asp Clu V*l He Asp Uiu lie V*l Clu Lys Clu Ljs 
1570 1575 



4176 



4224 



4272 



4320 



4368 



4416 



4464 



4512 



4560 



4600 



4656 



4704 



4752 



14/24 



CCC ATT CAA AAC CTT AAA COC AAA AAG AAA AAA TTA GAA AAA AAA CTT 
Arg Il« Clu Ly« V»l Ly» Al* Ly« Uym Ly» Ly« Lau Clu l.y» Ly« Vel 

1585 1590 1595 1600 

GAA isijL Gat err ACT cgt ctt aaa aaa cac gta cac gaa cta atc aaa 

Clu Clu Cly Val S«r Cly Itfu lym Ly* Mia Vml Asp Glu V«l M«c- Lya 
1605 1410 1415 

TAT GTt GAA AAA ATT CAT AAA CAA CTT GAT AAA GAA CTA TCT AAA CCT 
lyr V«l' dn Ly« Ila. A*p I^s Clu Val Aap Lys Clu Val Sar Lys Ala 
1420 1425 1430 ; 

TTA GAA TCA AAA AAT CAT CTT ACT AAT CTT TtA AAA CAA AAT CAA CAT 
Leu Clu Sar Lys Asn Amp Val Thr Asa Val Lau Lya Gin Aavi Cln Aap 
143S 14*0 14A5 

TTT TTT ACT AAA CTT AAA AAC TTC CIA AAA AAA TAT AAA bXA TTT CCT 
fhm Fha Sa* lya Val lya A«n thm Val tor* lym Tyr Lys Val Rm Ala 
1650. 145S 1440 

CCA CCA TTC AKA TCI GCC CTT <:CA OCA TIT CCA TCA TAT CTA. CTT COC • 
Ala Pro Pha Ila Sar Ala Val Ala Ala fha Ala Sar Tyr Val Val Cly 
1645 1470 i47S :i4«« 

TTC ITT ACA TTT TCt TTA TTT TCA TCA TCT GTA ACA ATA CCT. TCT TCA 
n»a n»a Ths n»« Sar lau IPba Sar Sar Ciya Val Tbr Ila Ala Ssir Sar 
■ . / . " 1485 ; . •: , ■ 1490- . ■ ■ ,l«95 

ACT TAC TTA TTA TCA AAA CTT GAC AAA ACT ATA AAT AAA AAT AAC 
Thr Tyr Lin* Law Sar Lya Val Aap lys Thr Ila Asn l^fa Asn Lys Clu . 
1700. ■ 1705..- 1710 -. 

AGA COC TIT TAt TCA TTT GTA TTT CAT ATC TTT AAG AAT TTA AAA' CAT 
Arr ft6 th* Tyv Ssr Tbm Val Fb« Asp Ila Fb* t.f Am lAu Lys His 
171S 1720 1725 

TAT TTA CAA. CAA ATC. AAA GAA AAA TTT ACT AAA GAA AAA AAT. AAT AAT 
Tyr Lau Cln Cln Hac Lys Clu Ljrs Fba Sar Lys Clu Lys Asn Asn Asn 

1730 1?35 17*0 

GTA ATA CAA CTA ACA AAC AAA CCt CAC AAA AAA CCT aAT CTA GAG CIA 
Val lie Clu Val Thr .Asn Lya Ala Clii Lys Lys dy Asn Val Cln Val 
1745 1750 ,1755 1740 

AbA AAT AAA ACC GAG AAA AOL ACT AAA CTT CAT AAA- AAT AAT AAA GTA 
Thr Asn Lys Thr Clu Lys Thr Thr Lys Val Asp Lys Aan Asii Lys Val 
1745 1770 17?* 

CCC. AAA AAA ACA ACA ACC CAA AAA TCA AAA TAA 
Pro Lys. Lys Arg Arg Thr Cln Lys Ser Lys * 
17B0 1785 



FIQWQK 2/Z 



15/24 

(2) INFORMATION FOR SEQ ZD NO: 3: 
(i) SEQUENCE CHAR^ICTERISTICS : 

(A) LENGTH: 1891 base pairs 

(B) TYPE: nucleotide 

(C) STRANDEDNBSS : double 

(D) TOPOLOGY: linear 
(ii) MOXjECULE TYPE: CDNA 

(ix) FEA'HJRE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2.. 1891 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

T iXA TtK tiCT OlJk i>GI inh m OKt iJa M 

X-« Thr Cl» 5.r VI A.P A«. Xy. A«H^ Clu Ciu Aij 



CAA CAT AlA AAC GAA AAT AlC TTA TIA ACT AAT AXA fSt 
A*p XI. Ly. CI- AMI !!• l*u I*u 8.r A«» »• ^ ^ 

• P> . ■ .■ •; •. •.- ■ ■'• • 

CAA AAi ATT AtT CAC AAT TTA TTA AAT AAT ATT CSA ^ j« «J ^ 
du A« XI* A«p A«k l-w A«a A«» !!• Cly Cln Ajj S«t Clu 

AAA CAA GAA A0r CTA TCA CAA AAT ClA CAA CTC ACT M GAA CTT JPTT 
W SiS Clu sS v3 ST Clu A« Val CU Va «.r A2 l-» «»* 

AAT OAA TTA ITA AAT ACT ClA GAT WT ^ ^ f** Sf^ i5 

SIS S« S« A*n $e« VAl A*p V»l A.« Cly Clu V.l W Cl^ Aw 

•.-65 . .' ■ • ■ 

ATT TTC CAC CAA ACT CAaW AAT CAC CAT^A^ AMT TTA CTA 

II. 1^ Clu Clu S«r Clii VAl A«n A«p Asp llii «»• A^ 

80 " . ;•. w ■ ; ■ •• . 

AAA ACT CTT CAA CAA CAA GAA CAA CAC AAT CIT CAA fjj Jjj 
M S«r V.1 GXti Gin Clu Cln cm HI. A.n V.l Clu Clu Uy« VmX Clu 

CAA ACT CTA CAA CAA AAT CAC CAA CAA ACT, CTA JJ* J^J ™ Si 
Clu S-r V.l Clu gU A«n A.p Clu Clu Smr v»l C^u Clu A«n VaI Clu 
1X5 120 



GAA AAT CTA GAA CAA AAT ^A^ «^ I" tf! ^1 SJS 

Clu A.« Val Clu Clu Asn Asp Asp Ply Scr V«l Ala S«r Scr VaI Clu 

130 135 
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CAA ACT ATA CCT TCA ACT CTT GAT CAA ACT ATA GAT TCA ACT ATT GAA ^78 
Glu Sar Il« Ala 5«r Sar Val Aap Clu Sar Ila A«p Smr Smt Xla Glu 

IAS 150 155 

GAA AAT GTA OCT CCA ACT CTT GAA CAA ATC GTA GCT CCA ACT CTT CAA 526 
Glu ASB Val Ala Pre Thr Val Clu Clu Ila Val Aim Txtt Thr V«l Clu 
160 1«5 170 175 

GAA ATT CTA CCT CCA ACT CTT CTA CAA ACT CTC CCT CCA ACT CTT CAA 574 
Clu 11* Val Ala Pro Sar Val Val Clu Scr Val Ala Fro Sar Val Clu 
180 ; 185 190 

CAA ACT CTA CCT CCA ACT CTT CAA CAA ACT CTA GCT CAA AAT CTT GAA 622 
Glu Sar Val Ala Pra 6ar Val Clu Glu Sar Val Ala Glu A*n Val Clu 

195 200 . . 205 

GAA ACT GTA CCT CAA AAT CTT CAA CAA AXC ClA CCt CCA ACT CTT CAA €70 
Clu Sar Val Ala Clu Aaa Val Glu Clu tla Val Ala Pro Sar Val Glu 

210 215 220 . 

CAA ACT CTA CCT GAA AAT CTT GAA GAA ACT GTA CCT GAA AAT CTT GAA ' 718 
Clu Sar val Ala Clu A»a Val Glu Clu Sar Val Ala Glu A«u Val Glu 

225 • . •,. 230 235 . •' 

CAA ACT CTA COT GAA AAT CTT GAA GAA ACT GZA GCT CAA AAT .CTT GAA 78$ 
Clu Sar Val Ala Glu Asa Val Clu Glu Sar Val Ala Clu Asn Val Clu 

240 . 245 . 250 255 

CAA ACT CTA CCT GAA AAT CTT GAA CAA ATC GCT CCA ACT CTT. (BAA 814 

Clu Sar Val Ala Clu Aatt V«l clu Clu Ila Val aI* Pro Thr Val Clu 

260' :2«S. .' ■• , 270 

CAA ACT CTA o6t CCA ACT Ctt CAA CAA ATT CTA GCT CCiA ACT CTT GAA iB62 
Clu Sar Val Ala Pre Thr Val Clu Glu Ila Val Ala Pro Thr Val Clu 

: 275 ■ 280. . 285 . . • 

GAA ACT CTA CCT CCA ACT CTT GAA GAA ATT CIA CTT CCA ACT CTT CAA 910 
Clu Sar VAl Aia Pre Thr Val Clu Clu Ila Val Val Pro Sar Val Qlia 

..290 ■. 295 -300 ■ • • ■■ 

GAA ACT CTA CCT CCA ACT CTT GAA GAA ACT CTA CCT GAA AAt CTT GAA 958 
Clu Sar Val Ala Pre Sar Val Clu Clu Sar Val Ala Glu Aan Val Glu 

305 .310 315 

GAA ACT CTA OCT GAA AAT CTT CAA CAA ACT CTA CCT GAA AAT CTT GAA 1006 
Clu Sar Val Ala Clu Aan Val Clu Clu Sar Val Ala Clu Aan V^l Clu 

320 325, 330. 

CAA ACT CTA CCT CAA AAT CTT CAA CAA ACT CTA OCT CAA AAT CTT CAA 1054 
Clu Scr Val Ala Clu Asn Val Clu clu Ser Val Ala Clu Asn V«l Clu 
340 34 5 350 
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CAA ATC CTA CCT CCA ACT CTt CAA CAA ATC CTA CCT CCA ACT CTI CAA 
Glu 11« V*X Al* Fro S«r V*X Olu Glu II* Val Al« ftro THr V«l Clu 
355 .360 365 

CAA ACT CTT CCT CAA AAC CTT CCA ACA AAT TTA ICA GAC AAT CIT TTA 
Clu Sttr V»l Ala Clu A»ti VaI AIa Thr A«n L»u S*r *«p A«n L»« L«u 
370 3>5 310 . 

ACT AAT TTA TTA CCT CCt ATC CAA ACT CAC CAA ASA AAG CiiC ACT 
S«r Asn L*u Leu Gly Cly il« Clu Thr Clu Glu tltt lys Asp S«r Iltt 

• 385 , '.390 395 

TTA AAT CAC ATA CAA GAA CTA AAA CAA AAT CTA CIC ACC ACA ATA CIA 
l«tt A«n Clu Ii« Clu Clu V*l lya Clu Asn ^*1 VmX Thr Thr Jl» tmn 
AOO ACfS 410 

CAA AAA CTA CAA CAA 4CT ACA CCT GAA ACT . CTA ACT ACT TTT ACT AAT 
Glu Ly« V*l Glu Ciu Ttar Thr AIa Clu Smr Vml Thr Thir ffc* »«r Am 

•. . "420 .-. .42* • . ♦^O • 

ATA TTA CAC GAC ATA CAA CAA AAT ACT ATT ACT AAT CAT ACT AtA CAC 
Il« lAU du Clu Xl« Clu Clu Ass Thr Il« Thr A»» Aaip Thr Xl« Clu 

• 435 • "440 

dAA AAA TtA GAA GAA CtC CAC CAA AAT CTA TTA ACT CCC OCT .TTA CAA 
Clu LM Lbu Clu Clu l»u MUi Glu Am V«I Lwii S«r .Al« Al«. l«u Clu 

••. 450--.., • 455 460-. 

AAT AOC CAA ACT CAA CAC CAA AAC AAA CAA cicA ATA CAT CTA ATT GAA 
Am Thr ClA S*r Clu Cl» Clu Ly* Lya CJu VaI XI* A«p V«l H« Clu. 

GAA CTA AAA CAA GAG CTC GCT ACC ACT .TIA ATA CAA ACT CTC CAA CAC 
Olu V«l Lys Clu ciu VaI Ala Thr Thy 1^ !!• Clu Thr V»l Clu da 
480 485 490 ^ ^ 

CCA GAA GAA GAC ACC GAA ACT ACA ATT AOC GAA ATA TTT CAA AAT TtA 
Ala Cltt Clu Clu .Sar. ciw S*ir Thr 11a Thr Glw Xlu Clu Am l*u 

.500-;' ■ ■ 505'.-. - . • : 

CAA CAA AAT CCA CTA GAA AjST AAT CAA AAA CTT CCA CAC AAT TTA CAC 
Glu Clu Asn Ala Vat Clu Sar Asn Clu Lys Val Ala Clu Asn >au Glu 

• . ■ -515 . 520 . . ■ 525 • 

AAA TTA AAC GAA ACt CTA TTT AAT ACT CTA TTA CAT AAA CTA CAC CAA 
Ly« IJtu Asn Clu Thr Val Fh* Aso Thr Val Lm Asp .Lys Val Clu Clu 

• 'SJO; ••• -535 :•• 5*0 

ACA CTA CAA ATT ACC CGA CAA ACT TTA CAA AAC AAT CAA ATC CAT AAA 
Thr Val Clu He Sar Cly Clu Sar Leu Clu Asn Asn Clu Met Asp Lys 
545 550 555 
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CCA TTT TTT ACT CAA ATA TTT OAT AXT Ct A AAA CCA ATA CAA CAA AAT 
AU Ph« th» imr Clu Il« Ph» A«p Am V»X Ly« Gly lie Clti Cl« A»n 
560 565 570 

TTA TTA AGA cot ATC TTT CCA ACT ATA CAA ACC ACT ATA CTA ATC CAA 
Uu Leu Thr Cly Ph» Arg XI* Clu Ttir S«r !!• V«l Xl« Cln 
580 5«5 590 

TCA CAA GAA AAC CTT CAT TIC AAT CAA AAT CTC CTT ACT TOC ATT TtA 
Ser CXii Clu Ly* Val Asp i«u Asa Clu A»n VaX Vml S«r Scr II* l«u 
595 . 600 605 

CAT AAT ATA CAA AAT ATC AAA. CAA CCT TTA TTA AAT AAA TTA CAA AAT 
Asp A«Ti Il« CXw AMtk Itec Ly» Clu Cly L»tt Lbu Asa Vym l*u Clu A«i 
610 615 fiO 

ATT TCA ACT ACT CAA GGC CAA 
XI* Sar S«r Thx clu Cly Clu 

630 . .• . 
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